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Date: October 18, 2000 
Docket No.: 1422-0442P 



Assistant Commissioner for Patents 
Washington, DC 20231 

Sir : 

This is a Request for filing a □ continuation ^ divisional 
□ continuation-in-part application under 37 C.F.R. § 1.53(b) of 
pending prior Application No. 08/952,089 filed on November 10, 
1997, the entire contents of which are hereby incorporated by 
reference, 
by 

SAGAWA, Hiroaki; UENO, Harumi; OSHIMA, At sushi; and KATO, 
Ikunoshin 

for 

PLASMID 



IE! Enclosed is an application consisting of specification, 
claims, declaration and drawings/photographs (if 
applicable) . 

El The filing fee has been calculated as follows: 
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LARGE ENTITY 


SMALL ENTITY 




BASIC FEE 


$710.00 


$355.00 




NUMBER 
FILED 


NUMBER 
EXTRA 


RATE FEE 


RATE FEE 


TOTAL 
CLAIMS 


9-20 = 


0 


X 18 = $0.00 


X 9 = $0 . 00 


INDEPENDENT 
CLAIMS 


6-3 = 


3 


X 80 = 
$240.00 


X 40 = $0.00 


□ MULTIPLE DEPENDENT 
CLAIMS PRESENTED 


+ $270.00 


+ $135.00 




TOTAL 


$950.00 


$0.00 



3. 1^ A check in the amount of $950.00 to cover the filing 

fee and recording fee (if applicable) is enclosed. 

4. □ Please charge Deposit Account No. 02-2448 in the amount 

of $0.00. A triplicate copy of this request is 
enclosed . 

5. Amend the specification by inserting before the first line 
thereof the following: : 

a. □ — This application is a □ continuation 

□ divisional □ continuation-in-part of co-pending 
Application No. 08/952,089, filed on November 10, 
1997, the entire contents of which are hereby 
incorporated by reference. — 

b. 1^ — This application is a □ continuation 

1^ divisional □ continuation-in-part of co-pending 
Application No. 08/952,089, filed on November 10, 
1997. Application No. 08/952,089 is the national 
phase of PCT International Application No. 
PCT/ JP97 / 00748 filed on March 10, 1997 under 35 
U.S.C. § 371. The entire contents of each of the 
above-identified applications are hereby incorporated 
by reference. — 

6. Enclosed is/are twenty-six (26) sheet (s) of formal 
drawings and/or photographs. 

7 . □ A statement claiming small entity status was filed in 

prior Application No. 08/952, 089 on . See the 

attached copy of the statement claiming small entity 
status . 
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8. ^ The prior application is assigned to Takara Shuzo Co., 

Ltd. . 

9. ^ A Preliminary Amendment is enclosed. 

10a. □ Priority of Application No(s). filed in 



is/are claimed under 35 U.S.C. § 119. See 

attached copy of the Letter claiming priority filed in 
the prior application on . 

10b. ^ Priority of International Appln . PCT/ JP97 /007 4 8 filed 
on March 10, 1997 under the Patent Cooperation Treaty 
and Japanese Application No. 8-85801 and 8-208897 filed 
in Japan on March 13, 1996 and July 18, 1996, 
respectively under 35 U.S.C. § 119 are hereby 
reclaimed . 

11. Information Disclosure Statement and PTO-1449 
forra(s) are attached hereto for the Examiner's 
consideration . 

12. 1^ Address all future communications to: 

BIRCH, STEWART, KOLASCH & BIRCH, LLP 
P.O. Box 747 

Falls Church, VA 22040-0747 
Telephone: (703) 205-8000 
or 

Customer No. 2292 

13. □ An extension of time for (_J_ month (s) until 

has been submitted in parent Application No. 08/952,089 
in order to establish co-pendency with the present 
application . 

14. ^ Also enclosed herewith is the following: 

Copies of receipts for deposits of microorganisms. 



3 



Docket No, 1422-0442P 



If necessary, the Commissioner is hereby authorized in this, 
concurrent, and future replies, to charge payment or credit any 
overpayment to Deposit Account No. 02-2448 for any additional 
fees required under 37 C.F.R. § 1.16 or under 37 C.F.R. § 1.17; 
particularly, extension of time fees. 



Respectfully submitted. 



BIRCH, STEWART, KOLASCH & BIRCH, LLP 
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P.O. Box 747 

Falls Church, VA 22040-0747 
(703) 205-8000 



Attachments 



(Re 



09/29/2000) 
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IN THE U.S. PATENT AND TRADEMARK OFFICE 



Applicant : 



Hiroaki SAGAWA et al. 



Appl . No , : 



Divisional of 08/952,089 Group: 



Unassigned 



Filed: 



October 18, 2000 



Examiner: Unassigned 



For : 



PLASMID 



PRELIMINARY AMENDMENT 



Assistant Commissioner for Patents 



October 18, 2000 



Washington, DC 20231 
Sir: 

The following preliminary amendments and remarks are 
respectfully submitted in connection with the above-identified 
application . 



IN THE SPECIFICATION : 
Page 19 

Line 22, please insert — pMS434 contains in part DNA having the 

DNA sequence identified as SEQ ID NO: 31. pMSP6L contains in part 
DNA having the DNA sequence identified as SEQ ID NO: 32. 

Line 24, please insert — pMS434 contains in part DNA having the 

DNA sequence identified as SEQ ID NO: 31. pMSP6L contains in part 



AMENDMENTS 



Docket No. 1422-0442P 

DNA having the DNA sequence identified as SEQ ID NO: 32. pMSP60 
contains in part DNA having the DNA sequence identified as SEQ ID 
NO:33. -- 

Please replace pages 82-100 of the specification with the 
Substitute Sequence Listing enclosed herewith. Please renumber the 
remaining pages, beginning with the claims, consecutively from page 
98 of the Sequence Listing. 

IN THE CLAIMS : 

Please cancel claim 1-19 without prejudice or disclaimer of 
the subject matter contained therein. 

Please amend the claims as follows: 

Claim 20, A method for isolating a desired gene, characterized in 
that [the] a plasmid vector comprising a promoter sequence to 
control an expression of a desired gene, said promoter sequence 
being recognized by an RNA polymerase not inherent to a host, and 
a replication origin for increasing a copy number by induction with 
an exogenous factor. 

Claim 28, line 2, please change "any one of claims 25 to 27" to 
— claim 25 — . 
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REMARKS 

Enclosed herewith in full compliance with 37 C.F.R. 1,821-1.825 
is a Substitute Sequence Listing to be inserted into the 
specification as indicated above. The Substitute Sequence Listing 
in no way introduces new matter into the specification. 

In further compliance with 37 C.F.R. 1.821-1.825 please 
transfer the disk copy of the Substitute Sequence Listing named file 
1422-319 . sub, filed on July 17, 1998 in the parent application to 
the present application. The disk copy of the Substitute Sequence 
Listing, file 1422-319 . sub, is identical to the paper copy submitted 
herewith, except that it lacks formatting. 

Entry of the above amendments is earnestly solicited. An early 
and favorable first action on the merits is earnestly solicited. 

Should there be any outstanding matters that need to be 
resolved in the present application, the Examiner is respectfully 
requested to contact MaryAnne Armstrong, Ph.D. (Reg. 4 0,069) at the 
telephone number of the undersigned below. 
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If necessary, the Commissioner is hereby authorized in this, 
concurrent, and future replies, to charge payment or credit any 
overpayment to Deposit Account No. 02-24 4 8 for any additional fees 
required under 37 C.F.R. § 1.16 or under 37 C.F.R. § 1.17; 
particularly, extension of time fees. 



Respectfully submitted, 

BIRCH, STEWART, KOLASCH & BIRCH, LLP 




Marc S. Weiner, #32,181 
P.O. Box 747 

MSW/MAA/csp Falls Church, VA 22040-0747 

1422-0442P (703) 205-8000 

(Rev. 04/19/2000) 
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DESCRIPTION 
PLASMID 

5 TECHNICAL FIELD 

The present invention relates to a plasmid vector 
usable in genetic recombinant technology and to a method 
for expressing a gene by using the plasmid vector. The 
present invention also relates to a method for isolating a 

10 desired gene by using such a plasmid vector. In addition, 
the present invention relates to a restriction enzyme and 
a gene thereof available as a genetic engineering reagent, 
in more detail to the AccIII restriction endonuclease and 
a DNA coding therefor. 

15 BACKGROUND ART 

In constructing an expression system for a desired 
gene by genetic recombinant technology, expression of the 
gene is con-trolled by bringing it under control of a 
promoter recognized by the RNA polymerase of the host 

20 used. In the case of a gene encoding a protein harmful to 
the host, however, plasmid construction itself is 
sometimes hampered by expression of the product of the 
gene due to the inability to stringently control the 
expression of the promoter used. 

25 As an expression system resolving that problem, the 
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pET system (produced by Novagen) has been developed, which 
uses -the RNA polymerase of the bacteriophage T7, which 
infects Escherichia coll, with Escherichia coli as a host 
[Journal of Molecular Biology^ Vol. 189, pp. 113-130 
5 (1986); Gene, Vol. 56, pp. 125-135 (1987)]. The pET 

system is a system that allows T7 RNA polymerase, which 
has high promoter recognition specificity and high 
transcription activity, to be expressed in Escherichia 
coli, which T7 RNA polymerase transcribes a desired gene 

10 placed downstream of the T7 promoter on an expression 

vector and causes high expression of the gene. Because 
transcription of the desired gene occurs in the presence 
of T7 RNA polymerase, plasmid construction in the host is 
possible without expressing -the desired gene, provided 

15 that the host does not produce the polymerase; plasmid 

construction itself is never hampered, as in cases where 
the expression system is constructed, while the desired 
gene is kept under control of a promoter recognized by the 
RNA polymerase of the host. 

20 However, because the T7 RNA polymerase gene has been 

cloned onto the X,-phage vector and lysogenized into the 
expression host, there is no freedom of host choice; 
painstaking procedures are needed if the host is changed. 
In addition, because the expression of T7 RNA polymerase 

25 in the host is not stringently controlled, T7 RNA 
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polymerase is expressed even when the host is in a 
non-inductive condition, resulting in expression of the 
desired gene placed downstream of the T7 promoter on the 
expression vector even in a non-inductive condition. To 
5 suppress such expression of the desired gene in a 

non-inductive condition, T7 RNA polymerase activity is 
inhibited using T7 lysozyme, a T7 RNA polymerase inhibitor 
[Journal of Molecular Biology, Vol. 219, pp. 37-44 
(1991)], or T7 RNA polymerase is prevented from getting 

10 access to the T7 promoter by placing a lactose operator 
downstream of the T7 promoter [Journal of Molecular 
Biology, Vol. 219, pp. 45-59 (1991)]. 

However, even these countermeasures are 
unsatisfactory in terms of effect against T7 RNA 

15 polymerase of high transcription activity so that the 

activity of T7 RNA polymerase in a non-inductive condition 
cannot be completely inhibited. For this reason, if the 
desired gene product is lethal to the host, it is 
impossible in some cases to prepare a transformant for 

20 expression of the gene, even when plasmid construction is 
possible. In other words, the pET system involves two 
problems to be resolved: one of the inability to freely 
change the host, and the other of inaccurate control of T7 
RNA polymerase expression. 

25 On the other hand, there is a bacteriophage having 
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characteris-tics similar to those of the bacteriophage T7, 
known as the bacteriophage SP6 [Science, Vol. 133, pp. 
2069-2070 (1961)3, which infects Salmonella typhimurium. 
The RNA polymerase produced by the bacteriophage SP6, a 
5 single peptide having a molecular weight of about 100,000, 
is commonly used for in vitro RNA synthesis since it 
possesses high promoter recognition specificity and high 
transcription activity [Journal of Biological Chemistry, 
Vol. 257, pp. 5772-5778 (1982); Journal of Biological 

10 Chemistry, Vol. 257, pp. 5779-5788 (1982)]. In addition, 
the SP6 RNA polymerase gene has already been cloned and 
expressed in large amounts in Escherichia coll [Nucleic 
Acids Research, Vol. 15, pp. 2653-2664 (1987)]. 

Genes whose expression product acts lethally on hosts 

15 are exemplified by restriction endonuclease genes. 

Essentially, restriction endonucleases are utilized for 
self-defence by cleaving phages and other exogenous DNA 
entering the cells of microorganisms that produce the 
restriction endonucleases. On the other hand, 

20 microorganisms that produce restriction endonucleases 
mostly produce modification enzymes that recognize the 
same base sequences as those of the restriction 
endonucleases, to protect their own DNA against cleavage 
by the restriction endonucleases. Specifically, a 

25 modification enzyme modifies DNA by adding a methyl group 
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to one or more bases in the base sequence recognized 
thereby, to make it impossible for the restriction 
endonuclease that recognizes the same sequence as that of 
the modification enzyme to bind thereto or to cleave the 
5 DNA. This mechanism is called restriction raodificatioh 
system, and the pair of genes of the restriction 
endonuclease and modification enzyme that constitute the 
restriction modification system called restriction 
modification system gene. Therefore, when the restriction 

10 endonuclease gene is expressed in a microorganism lacking 
a modification enzyme gene from the restriction 
modification system gene, the microorganism's DNA is 
cleaved, resulting in cell death. In fact, there are two 
modification enzyme genes in the Mbol restriction 

15 modification system gene; it has been reported that 

cloning of restriction endonuclease genes is impossible 
due to incomplete modification of the host DNA in the case 
of incomplete methylation in the co-presence of either 
modification enzyme gene alone [Nucleic Acids Research, 

20 Vol. 21, pp. 2309-2313 (1993)]. 

Also, it has been demonstrated that if a restriction 
modification system gene is lost from a cell retaining the 
restriction modification system gene, a lack of 
modification activity in the cell results in incomplete 

25 methylation of genomic DNA, which in turn causes lethal 



cleavage of its own genomic DNA by a very small amount of 
restriction endonuclease remaining therein [Science, 
Vol. 267, pp. 897-899 (1995)]. In summary, in the absence 
of modification enzymes that constitute a restriction 
5 modification system, restriction endonucleases behave as 
proteins very harmful to cells; separate cloning and 
expression of their genes have been impossible by prior 
art technologies . 

Concerning restriction endonucleases, restriction 

10 endonucleases can be classified by their enzymatic 

properties into three types: I, II and III. Type II 
restriction endonucleases, in particular, each of which 
recognizes a particular DNA base sequence and cleaves it 
at a particular site in or near the sequence, are 

15 extensively used in the field of genetic engineering, and 
restriction endonucleases of this type with various 
specificities have been isolated from a variety of 
microorganisms [Nucleic Acids Research, Vol. 24, pp. 
223-235 (1996)]. In the present specification, a type II 

20 restriction endonuclease is hereinafter referred to as 

"restriction endonuclease". It should be noted, however, 
that some microorganisms produce only small amounts of 
restriction endonuclease, and others produce a plurality 
of restriction endonucleases. For example, the 

25 restriction endonuclease AccIII is produced by 
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Acinetobacter calcoacetlcus (hereinafter referred to as 
Acc bacterium), which has been deposited under accession 
number PERM BP-935 at the National Institute of Bioscience 
and Human-Technology, Agency of Industrial Science and 
5 Technology, Ministry of International Trade and Industry 
[address: 1-3, Higashi 1-chome, Yatabemachi, Tsukuba-gun, 
Ibaraki, 305, Japan] since November 9, 1985 (date of 
original deposition), but the amount of the enzyme 
produced is small and this microorganism also produces the 

10 restriction endonucleases AccI and AccII simultaneously. 
Therefore, advanced production technology is needed to 
provide the restriction endonuclease AccI I I as a reagent 
of high purity and low cost using this microorganism. In 
providing a restriction endonuclease as a reagent of high 

15 purity and low cost, it is effective to isolate the 

desired restriction endonuclease gene and selectively 
produce the desired restriction endonuclease in large 
amounts by genetic engineering technology. To accomplish 
this purpose, some methods of isolating restriction 

20 endonuclease genes have been reported. 

First, there may be mentioned the "shotgun" method, 
wherein the genomic DNA of a microorganism that produces a 
restriction endonuclease is cleaved using the appropriate 
restriction endonuclease, the resulting fragment is 

25 inserted into an appropriate plasmid vector, and a clone 
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expressing the res-fcriction endonuclease gene is selected- 
Screening methods for desired clones are exemplified by a 
method wherein a restriction modification system gene is 
isolated with resistance to phage infection as an index, 
5 on the basis of the self-defence function acquired by the 
host upon introduction of the restriction modification 
system gene thereinto [PstI: Proceedings of the National 
Academy of Science of the USA, Vol. 78, pp. 1503-1507 
(1981)]. This method, however, necessitates that the size 

10 of the restriction modification system gene falls within a 
range allowing its isolation, and that the expression of 
the restriction modification system gene isolated exhibits 
sufficient phage resistance to allow the selective 
survival of the host. On the other hand, as a general 

15 feature of restriction modification system genes, there 
may be mentioned the close location on the genome of 
restriction endonuclease genes and modification enzyme 
genes; in fact, this has been confirmed in many 
restriction modification system genes that have so far 

20 been obtained [Nucleic Acids Research, Vol. 19, pp. 

2539-2566 (1991)]. Accordingly, there is a method wherein 
a restriction modification system gene is screened for 
with the expression of a modification enzyme gene as an 
index on the basis of the above-described feature 

25 [Japanese Patent Laid-Open No. 63-87982; Nucleic Acids 
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Research, Vol. 19, pp. 1831-1835 (1991)]. When the 
restriction endonuclease gene is not close to the 
modification enzyme gene, however, this method fails to 
yield the restriction endonuclease gene. 
5 Furthermore, the above-described "shotgun" method ■ 

poses a fundamental problem associated with a difference 
in transcription-translation mechanism between the genomic 
DNA source organism and the host. For example, in the 
case of insufficient gene expression due to the failure of 

10 the promoter and ribosome binding site accompanying the 
restriction modification system gene to function well in 
the host, much labor is needed to select transf ormants 
containing the desired gene, even if obtained. To avoid 
this drawback, there is a method wherein the amino acid 

15 sequence of the restriction endonuclease protein is 

analyzed, the restriction endonuclease gene is obtained 
from the genomic DNA of a microorganism that produces the 
restriction endonuclease by PCR-based DNA amplification on 
the basis of the sequence data obtained, and wherein a 

20 known protein expression system is utilized [Japanese 

Patent Laid-open No. 6-277070] . Because the presence of a 
restriction endonuclease is lethal to the host in 
conventional protein expression systems, there is a need 
to protect the host by, for example, allowing a 

25 modification enzyme that constitutes a restriction 



- 10 - 



modification system together with the enzyme to be 
co-present - 

Although all the above-described methods o£ the 
isolation of restriction endonuclease genes necessitate 
the slmul-taneous Isolation of the restriction endonuclease 
gene and a modification enzyme gene that constitutes a 
restriction modification system gene together with the 
gene, another method of isolating the restriction 
endonuclease gene alone has been reported [Nucleic Acids 
Research, Vol. 22, pp. 2399-2403 (1994)]. In that method, 
however, it is intended to isolate a gene encoding a 
restriction endonuclease for which optimal temperature for 
enzyme activity is around 70 "C; the co-presence of a 
modification enzyme gene is necessary when the gene to be 
isolated encodes a restriction endonuclease showing high 
specific activity near host culturing temperature. 

Exceptionally, there are ' restriction endonucleases 
that do not show cleavage activity unless a particular 
nucleic acid base in their DNA recognition sequence has 
not been modified by methylation, like the restriction 
endonuclease Dpnl . Genes for restriction endonucleases 
possessing this property are thought to be exceptional in 
that they can be isolated even in the absence of another 
particular gene by selecting the appropriate host 
organism. In fact, the mrr gene has been isolated, which 
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encodes the Mrr protein, which is not a type II 
restriction endonuclease but which recognizes a particular 
DNA base sequence containing a methylated nucleic acid 
base and exhibits DNA cleavage activity [Journal of 
Bacteriology, Vol. 173, pp. 5207-5219 (1991)]. 

As stated above, isolation of a restriction 
endonuclease gene by the prior art necessitates the 
simultaneous expression of the gene and a modification 
enzyme gene that constitutes a restriction modification 
system gene together with the gene, except for special 
cases . 

DISCLOSURE OF THE INVENTION 

Therefore, a first object of the present invention 
provides a plasmid vector capable of isolating such a gene 
that an isolation thereof or a construction of an 
expression system thereof has '^been difficult in the prior 
arts because the gene product is lethal or harmful to a 
host, and capable of introducing into a host to express 
the protein efficiently. A second object of the present 
invention provides a method for expressing a desired gene 
by using this plasmid vector. A third object of the 
present invention provides a method for isolating a 
desired gene by using this plasmid vector, especially for 
isolating a restriction endonuclease gene without co- 
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existence of a modif ication enzyme gene constitu-ting a 
restriction modification system. A fourth object of the 
present invention provides a polypeptide possessing an 
activity of an AccIII restriction endonuclease . In 
addition, a fifth object of the present invention provides 
a DNA which encodes a polypeptide possessing an activity 
of an AccIII restriction endonuclease. 

First, to resolve the problems in the pET system, the 
present inventors have constructed an expression system 
using SP6 RNA polymerase as a new accurate expression 
control expression system, and assessed the system. 

Because the expression system is constructed using a 
system plasmid inserted the SP6 RNA polymerase gene into 
the miniF plasmid, expression systems for the desired gene 
can be constructed in various strains of Escherichia coll 
by simultaneously introducing an expression vector 
harboring the desired gene cloned downstream of the SP6 
promoter and this system plasmid into the host. In 
addition, using the lac promoter and antisense technology, 
the present inventors made it possible to exactly control 
the expression of the SP6 RNA polymerase gene on the 
system plasmid. Assessing this expression system using 
the p-galactosidase gene as a reporter gene demonstrated 
that the expression of the SP6 RNA polymerase gene is 
accurately controlled in this system, that there is almost 
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no expression of SP6 RNA polymerase in a non-inductive 
condition, and that induction is followed by the 
expression of a sufficient amount of SP5 RNA polymerase to 
efficiently express the desired gene and subsequent 
5 expression of the desired gene at a high level. 

It was also shown, however, that when the host used 
is Escherichia coll, the desired gene downstream of the 
SP6 promoter is expressed in very small amounts even in a 
non-inductive condition because the SP6 promoter is very 

10 weakly but actually recognized by Escherichia coll RNA 

polymerase. It was thus proven that when the gene product 
acts very harmfully and lethally to Escherichia coll, 
expression system construction is impossible so that the 
object of the present invention cannot be accomplished 

15 well. 

With these findings in mind, the present inventors 
have made further extensive investigation and unexpectedly 
found that (i) expression of the desired gene in a 
non-inductive condition can be suppressed to undetectable 

20 levels, and that (ii) expression induction increases the 

copy number of the plasmid containing the desired gene and 
causes RNA polymerase expression, resulting in the 
transcription and translation of the desired gene placed 
downstream of a promoter recognized by the RNA polymerase, 

25 by using a new system for controlling the expression of 
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the desired gene by means of a combination of two control 
methods, i.e., control of the copy number of the gene, and 
transcription control via the promoter. In other words, 
the plasmid vector of the present invention is a plasmid 
vector having unique features to resolve the above 
problems in the field of genetic engineering. The 
present inventors made further investigation based on this 
finding, and developed a method for expressing the desired 
gene using the vector. 

The present inventors also have developed a method 
for isolating a gene whose expression product acts 
lethally on the host, using the above-described plasmid 
vector, more specifically a method of isolating a 
restriction endonuclease gene without the drawbacks of the 
prior art. 

The present inventors also found it possible to 
isolate DNA encoding the AccIII restriction endonuclease, 
which had not been obtained so far, without the 
co-presence of an AccIII modification enzyme, and express 
the AccIII restriction endonuclease in large amounts, 
using the above-described method. 

(1) The gist of the present invention is concerned with: 
A plasmid vector characterized by comprising a promoter 
sequence to control an expression of a desired gene, the 
promoter sequence being recognized by an RNA polymerase 
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not inherent to a host, and a replication origin for 
increasing a copy number by induction with an exogenous 
factor; 

(2) The plasraid vector described in item (1) above, 
wherein the promoter sequence is recognized by RNA 
polymerases derived from bacteriophages; 

(3) The plasmid vector described in item (2) above, 
wherein the promoter sequence is recognized by an RNA 
polymerase derived from SP6 phage; 

(4) The plasmid vector described in item (3) above, 
wherein the promoter sequence contains the base sequence 
of SEQ ID NO: 30 in the Sequence Listing; 

( 5 ) The plasmid vector described in any one of items ( 1 ) 
to (4) above, wherein the replication origin is under 
control of a promoter; 

( 6 ) The plasmid vector described in any one of items ( 1 ) 
to (5) above, wherein the replication origin is under 
control of the lac promoter; 

(7) The plasmid vector described in any one of items (1) 
to (6) above, comprising a drug resistance gene as a 
selection marker; 

(8) The plasmid vector described in item (7) above, which 
is selected from pACE601, pACE611, pACE701 and pACE702; 

( 9 ) A plasmid vector in which a desired gene to be 
expressed is incorporated into the plasmid vector 
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described in any one of items (1) to (8) above; 

(10) A method for expressing a desired gene, characterized 
by introducing into a host a plasmid vector in which the 

5 desired gene is incorporated into the plasmid vector 

described in any one of items (1) to (8) above, and an RNA 
polymerase gene which recognizes a promoter sequence in 
the plasmid vector, and inducing an increase in a copy 
number of the plasmid vector and an expression of the RNA 
10 polymerase by using an exogenous factor to transcribe and 
translate the desired gene; 

(11) The method for expressing a desired gene described in 
item (10) above, characterized in that the increase in the 
copy number of the plasmid vector and the expression of 

15 the RNA polymerase are induced by respective exogenous 
factors; 

(12) The method for expressing a desired gene described in 
item (10) above, characterized in that the increase in the 
copy number of the plasmid vector and the expression of 

20 the RNA polymerase are induced by a same exogenous factor; 

(13) The method for expressing a desired gene described in 
any one of items (10) to (12) above, wherein the exogenous 
factor which induces the increase in the copy number of 
the plasmid vector, is one or more selected from the group 

25 consisting of an addition of isopropyl-p-D-thiogalactoside 
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(IPTG), an addition of lactose, an addition of galactose, 
an addition of arabinose, a reduction of a tryptophane 
concentration and an adjustment of a transf ormant 
cultivation temperature; 

(14) The method for expressing the desired gene described 
in any one of items (10) to (12) above, wherein the 
exogenous factor which induces the expression of the RNA 
polymerase, is one or more selected from the group 
consisting of an addition of isopropyl-p-D-thiogalactoside 
(IPTG), an addition of lactose, an addition of galactose, 
an addition of arabinose, a reduction of a tryptophane 
concentration and an adjustment of a transf ormant 
cultivation temperature; 

(15) The method for expressing a desired gene described in 
item (10) above, characterized in that the RNA polymerase 
gene is introduced into the host by the other plasmid 
vector or a phage vector; 

(16) The method for expressing a desired gene described in 
item (10) above, characterized in that the RNA polymerase 
gene is incorporated into a chromosome of the host; 

(17) The method for expressing a desired gene described in 
item (15) or item (16) above, characterized in that the 
RNA polymerase gene is derived from SP6 phage; 

(18) The method for expressing a desired gene described in 
any one of items (10) to (17) above, wherein the desired 
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gene encodes a protein lethal or harmful to the host; 

(19) The method for expressing a desired gene described in 
any one of items (10) to (18) above, characterized in that 
Escherichia coll is used as the host; 

(20) A method for isolating a desired gene, characterized 
in that the plasmid vector described in any one of items 
(1) to (8) above is employed in the method for isolating 
the desired gene; 

(21) The method for isolating a desired gene described in 
item (20) above, wherein the desired gene encodes a 
protein lethal or harmful to a host; 

(22) The method for isolating a desired gene described in 
item (21) above, wherein the gene encoding a protein 
lethal or harmful to the host is a restriction 
endonuclease gene; 

(23) A polypeptide containing the entire or a portion of 
the amino acid sequence shown by SEQ ID N0:1 in the 
Sequence Listing, and possessing an activity of AccIII 
restriction endonuclease; 

(24) A polypeptide having an amino acid sequence resulting 
from at least one of deletion, addition, insertion or 
substitution of one or more amino acid residues in the 
amino acid sequence Of SEQ ID N0:1 in the Sequence Listing 
or a portion thereof, and possessing an activity of AccIII 
restriction endonuclease; 
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(25) A DNA encoding a polypeptide which contains the 
entire or a portion of the amino acid sequence shown by 
SEQ ID N0:1 in the Sequence Listing, and possesses an 
activity of AccIII restriction endonuclease; 
5 (26) A DNA containing the entire or a portion of the DNA 
shown by SEQ ID NO: 2 in the Sequence Listing wherein an 
expression product of the DNA possesses an activity of 
AccIII restriction endonuclease; 

(27) A DNA encoding a polypeptide resulting from at least 
10 one of deletion, addition, insertion or substitution of 

one or more amino acid residues in the amino acid sequence 
of SEQ ID N0:1 in the Sequence Listing or a portion 
thereof, and possessing an activity of AccIII restriction 
endonuclease; and 
15 (28) A DNA capable of hybridizing to the DNA described in 
any one of items (25) to (27) above, and encoding a 
polypeptide possessing an activity of AccIII restriction 
endonuclease . 



20 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the constitution of the model 
expression plasmids pMSP6L and pMSP6F. 

Figure 2 shows the constitution of the model 
expression plasmid pMSP60. 
25 Figures 3 (A), (B) show the procedure of the 
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construction of the runaway plasmid pHS2870. 

Figure 4 shows the constitution of the runaway 
plasmid pHS2870. 

Figures 5 (A), (B) show the procedure of the 
construction of the plasmid pCRS04. 

Figure 6 shows the constitution of the plasmid 
pCRS04. 

Figure 7 shows the constitution of the plasmid 
PCRA19. 

Figures 8 (A), (B), (C), (D), (E) show the procedure 
of the construction of the system plasmid pFSP6. 

Figures 9 (A), (B) show the procedure of the 
construction of the plasmid pACE601. 

Figure 10 shows the constitution of the plasmid 
PACE611. 

Figures 11 (A), (B) show the procedure of the 
construction of the plasmid pACE611. 

Figures 12 (A), (B) show the procedure of the 
construction of the plasmid pCRS70. 

Figures 13 (A), (B) show the procedure of the 
construction of the plasmids pACE701 and pACE702. 

Figure 14 shows the expression of the activity of Nsp 
7524 III restriction endonuclease depending on the 
induction or non-induction of IPTG, by agarose gel 
electrophoresis of the degradation reaction solution of 
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DNA after introducing pFSP6 and pACE601 or pFSP6 and 
pACE601-NspIII into Escherichia coll HBIOI. In this 
figure, M indicates the X,-EcoT141 size marker; 1 indicates 
Aval digested X-DNA; 2 indicates HB101/pFSP6/pACE601 after 
5 induction; 3 indicates HB101/pFSP6/pACE601 before 

induction; 4 indicates HB101/pFSP6/pACE601-NspIII after 
induction; and 5 indicates HB101/pFSP5/pACE501-NspIII 
before induction. 

Figure 15 shows a result of an agarose gel 

10 electrophoresis in the case of 2 and 3 hour-elongation of 
the degradation reaction of X-DNA depending on the 
induction or non-induction of IPTG after introducing pFSP6 
and pACE601 or pFSP6 and pACESOl-NspIII into Escherichia 
coli HBIOI. In this figure, M indicates the X-EcoT141 

15 size marker; 1 indicates Aval digested A,-DNA; 2 indicates 
HB101/pFSP6/pACE601 after 1 hour (induction); 3 indicates 
HB101/pFSP6/pACE601 after 2 hours (induction); 4 indicates 
HB101/pFSP6/pACE601 after 3 hours (induction); 5 indicates 
HB101/pFSP6/pACE601 after 1 hour ( non- induct ion ) ; 6 

20 indicates HB101/pFSP6/pACE601 after 2 hours (non- 
induction); 7 indicates HB101/pFSP6/pACE601 after 3 hours 
(non-induction); 8 indicates HBl01/pFSP6/pACE601-NspIII 
after 1 hour (induction); 9 indicates HB101/pFSP6/pACE601- 
NspIII after 2 hours (induction); 10 indicates 

25 HBl01/pFSP6/pACE601-NspIII after 3 hours (induction); 11 
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indicates HBl01/pFSP6/pACE601-NspIII after 1 hour (non- 
induction); 12 indicates HBl01/pFSP6/pACE601-NspIII after 
2 hours (non-induction); and 13 indicates 

HB101/pFSP6/pACE601-NspIII after 3 hours ( non- induct ion ) . 

Figure 16 shows the construction of the AccIII 
restriction modification system gene. 

BEST MODE FOR CARRYING OUT THE INVENTION 

The first aspect of the present invention relates to 
a plasmid vector. More specifically, it relates to a 
plasmid vector characterized in that the plasmid vector 
comprises a promoter sequence, which is recognized by an 
RNA polymerase not inherent to the host, to control the 
expression of the desired gene, and a replication origin 
for increasing the copy number by induction with an 
exogenous factor. The promoter sequence contained in the 
plasmid vector, which sequence controls the expression of 
the desired gene, may be any promoter sequence; promoter 
sequences recognized by particular RNA polymerases, e.g., 
those recognized by RNA polymerases derived from the T7, 
T3, SP6 and other bacteriophages, can be used, with 
preference given to those recognized by the RNA polymerase 
derived from the SP6 phage. The base sequence of the 
minimum region of the promoter recognized by the RNA 
polymerase derived from the SP6 phage, i.e., the SP6 
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promoter, is shown by SEQ ID NO: 30 in the Sequence 
Listing. Furthermore, the plasmid vector may have a drug 
resistance gene used as a selection marker. 

Exogenous factors that control the copy number of the 
plasmid vector of the present invention, i.e., the 
function of the replication origin of the plasmid vector, 
include, but are not limited to, for example, the addition 
of various chemicals, such as lactose or structurally 
related substances like isopropyl-p-D-thiogalactoside 
(IPTG), galactose or structurally related substances, and 
arabinose or structurally related substances; reduction of 
tryptophane concentration; and adjustment of trans formant 
cultivation temperature. By bringing the replication 
origin under control of a promoter that can be controlled 
by these factors, copy number control becomes possible. 
Promoters usable for this purpose include, but are not 
limited to, the lac, trp, tac, gal, ara and promoters 
etc. when the host used is Escherichia coll, as long as 
the above-described purpose is accomplished. These 
promoters can also be used to induce the expression of an 
RNA polymerase gene. 

Examples of the replication origin of the plasmid 
vector of the present invention include, but are not 
limited to, plasmid vectors that serve as replication 
origins of runaway plasmids under control of the lac 
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promoter. In this case, copy number control is achieved 
by the addition of lactose and structurally related 
substances, most preferably by the addition of 
isopropyl-p-D-thiogalactoside ( IPTG) . 
5 The second aspect of the present invention relates to 

a method for expressing a desired gene to be expressed, 
characterized by introducing a plasmid vector prepared by 
incorporating the desired gene into the plasmid vector of 
the present invention, and an RNA polymerase gene that 

10 recognizes the promoter sequence on the plasmid vector, 
into a host, and inducing an increase in the copy number 
of the plasmid vector and the expression of the RNA 
polymerase with an exogenous factor to transcribe and 
translate the desired gene. RNA polymerase genes that 

15 recognize the promoter sequence on the plasmid vector 
include, but are not limited to, for example, the RNA 
polymerase genes derived from the above-mentioned 
bacteriophages, with preference given to the RNA 
polymerase gene derived from the SP6 phage. Exogenous 

20 factors that induce the expression of such RNA polymerase 
or an increase in the copy number of the plasmid vector 
Include, but are not limited to, for example, the addition 
of various chemicals, such as lactose or structurally 
related substances like isopropyl-p-D-thiogalactoside 

25 (IPTG), galactose or structurally related substances, and 
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arabinose or structurally related substances; the 
reduction of tryptophane concentration; and the adjustment 
of transformant cultivation temperature. 

Induction of an increase in the copy number of 
plasmid vector and that of expression of RNA polymerase 
may be achieved by the action of respective exogenous 
factors or a same exogenous factor. Also, in this case, 
the gene encoding RNA polymerase may be incorporated onto 
a chromosome of the host, or introduced into the host by a 
plasmid vector other than the plasmid vector of the 
present invention or a phage vector- In the latter case, 
the host can readily be changed according to the purpose. 
In that case, introduction of the vector having an RNA 
polymerase gene into the host may precede or follow the 
introduction of the plasmid vector of the present 
invention. 

Although the desired gene in the present invention is 
not subject to limitation, it is meaningful when it 
encodes a protein lethal or harmful to the host. Such 
proteins include, for example, the above-mentioned 
restriction endonucleases, other nucleases, nucleic 
acid-binding proteins, and proteases. 

The present invention provides a system that controls 
the expression of a desired gene by a combination of two 
control methods, i.e., control of the copy number of the 
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gene, and control of transcription via a promoter, as 
described above. Control of the expression of the desired 
gene cloned onto a plasmid in the present invention, 
unlike conventional control, is very stringent so that its 
expression can be suppressed to an undetectable level in a 
non-inductive condition. When the copy number of the 
plasmid, i.e., the copy number of the desired gene, is 
increased, with concurrent expression of RNA polymerase, 
by expression induction, the desired gene is transcribed 
and translated under control of the promoter by the action 
of the RNA polymerase. 

It is therefore possible to use the plasmid vector of 
the present invention to prepare a transformant containing 
a gene harmful or lethal to the host and express the gene, 
a task difficult to achieve by the prior art. 

For example, when cloning of a gene encoding the 
Nsp7524 III restriction endonuclease was attempted using 
the plasmid vector of the present invention, the gene was 
successfully retained in Escherlcjfila coll without 
co-presence of the corresponding modification enzyme gene. 

It was also possible to express the Nsp7524 III 
restriction endonuclease in the cells of the resulting 
transformant by introducing a system plasmid containing an 
RNA polymerase gene into the transformant to induce the 
expression of the Nsp7524 III restriction endonuclease. 
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Also, using the plasmid vec1;or of "the present 
invention as a cloning vector for preparation of a gene 
library, it is possible to isolate a gene that cannot be 
isolated by a conventional method, and confirm the 
5 activity of its expression product in a single host. The 
use of the plasmid vector of the present invention is of 
course not limited to the cloning of a gene encoding a 
product harmful to the host, and can be used as a 
general-purpose plasmid vector. 

10 An RNA polymerase gene relating to the present 

invention can be introduced into a host using, for 
example, the plasmid pFSP6 and the phage M13sp6. Details 
of construction of the plasmid pFSP6 are shown in 
Reference Example (2). Escherichia coll HBlOl as 

15 transformed with the plasmid, designated Escherichia coll 
HB101/pFSP6, has been deposited under accession number 
FERM BP- 5742 at the National Institute of Bioscience and 
Human- Technology, Agency of Industrial Science and 
Technology, Ministry of International Trade and Industry 

20 [address: 1-3, Higashi 1 chome, Tsukuba-shi, Ibaraki-ken, 
305, Japan] since December 22, 1995 (date of original 
deposition). A method for the construction of the phage 
M13sp6 is described in Example 2 ( 6 ) . 

The present invention is hereinafter described in 

25 more detail with reference to a case using the plasmid 
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pFSP6. 

First described are the results of an experiment 
conducted using the constructed multicopy model expression 
plasmids pMSP6L, pMSP6F (Figure 1) and pMSP60 (Figure 2), 
all having a promoter sequence recognized by SP6 RNA 
polymerase and the p-galactosidase gene as a reporter 
gene, and all allowing the expression of the desired gene 
using the above-described system plasmid pFSP6 . 

Details of construction of these model expression 
plasmids are shown in Reference Example ( 3 ) . These 
plasmids incorporate the minimum region of the SP6 
promoter (pMSP6L), a inherent SP6 promoter region 
(pMSP6F), or both the minimum region of the SP6 promoter 
and the lac operator region (pMSP60), respectively, as an 
SP6 plasmid sequence. 

When the plasmid pMSP6L or pMSP6F alone was 
introduced into the Escherichia coli strains shown in 
Table 1, p-galactosidase activity was noted, despite the 
fact that the host Escherichia coli did not express SP6 
RNA polymerase, the amounts being higher than that 
obtained with the plasmid pMS434, which does not have the 
SP6 promoter, as shown in Table 2. In short, it can be 
conjectured that the SP6 promoter is recognized by the RNA 
polymerase derived from Escherichia coli, and that the 
p-galactosidase gene downstream thereof is transcribed and 
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translated. Also, when Escherichia coll MRi80, a strain 
that has a mutation in the pcnB gene and wherein the copy 
number of a plasmid having a replication origin derived 
from ColEl, like the above-described model of expression 
plasmids, is reduced by 1/6 to 1/13 of that of the parent 
Strain MRi7, was used as a host, the p-galactosidase 
activity was also decreased to 1/7 to 1/14 of that of 
MRi7. This suggests a correlation between the 
p-galactosidase gene expression level and the copy number 
of the plasmid introduced. 

Table 1 



Strain 


Genotype 


Origin / Reference 


MC4100 


F- araD139, A (argF-lac)U169. 
thlA, rpsLIBO. relAI, flbB5301, 
deoCI, ptsF25, rbsR 


Casadaban et al. 

J. Mol.Biol.104(1976) 

541-555. 


MRi7 


MC4100Arbs-7 


Lopi lato et ai. 

J. Bacteriol. 158(1984) 

665-673 


MR 180 


MR 17 pcnBSO 


Lopi lato et al. Mol. 
Gen. Genet. 205(1986) 
285-290. 



Table 2 





$— Galactosidase Activity 


Strain 


Plasmid 




pMSPBF 


PMSP6L 


PMS434 


MC4100 


202 


242 


30 


MRi7 


234 


287 


23 


MR 180 


17 


27 


4 
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Next, when the same model of expression plasmids were 
introduced into Escherichia coli together with the 
above-described system plasmid pFSP6 and p-galactosidase 
activity was determined under conditions such that the 
expression of the SP6 RNA polymerase gene was not induced, 
the p-galactosidase activity was at most as high as that 
obtained with Escherichia coli not containing the system 
plasmid, as shown in Table 3. This demonstrates that the 
expression of the SP6 RNA polymerase gene on the plasmid 
pFSP6 is almost completely suppressed in a non-inductive 
condition. On the other hand, when the expression of the 
SP6 RNA polymerase gene was induced by the addition of 
isopropyl-p-D-thiogalactoside (IPTG), p-galactosidase 
activity was increased by 18 to 32 times, demonstrating 
that the p-galactosidase gene can be expressed via the SP6 
promoter. When the host used was Escherichia coli MRiSO, 
in particular, p-galactosidase activity was increased to a 
level 25 to 32 times that obtained in a non-inductive 
condition, 4 hours after induction, though the activity in 
a non-inductive condition remained at a very low level. 
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Table 3 



Strain 


/3- Galactosidase Activity (Non- Induct ion/ Induct ion) 


pMSPSF 


pMSPSL 


PMS434 


Non- 
Induction 


1 nduc- 
tion 


Non- 
Induction 


Induc- 
tion 


Non- 
Induction 


Induc- 
tion 


MC4100(pFSP6) 


221 


4875 


234 


5256 


25 


19 


MRi7(pFSP6) 


220 


3958 


242 


4243 


25 


21 


MRi80(pFSP6) 


17 


425 


30 


970 


3 


ND 



In the plasmid pMSP60, which carries the lac operator 
sequence downstream of the SP6 promoter, the 
p-galactosidase activity under non-inductive conditions is 
lower than that in other model expression plasmids; 
however^ its expression cannot be suppressed completely. 
These results suggest that the expression of the desired 
gene is difficult to completely suppress in an expression 
system using a multicopy plasmid, and that controlling the 
copy number of the plasmid containing the desired gene is 
more effective in resolving this problem than controlling 
the expression of the RNA polymerase gene on the system 
plasmid. 

The present inventors thus made further investigation 
based on these results to explore a new expression vector, 
and developed the plasmid vector of the present invention. 
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Specifically, the present inventors constructed a 
runaway plasmid vector capable of reducing the expression 
level in a non-inductive condition to ensure sufficient 
expression after induction. First, in consideration of 
5 the fact that SP6 RNA polymerase expression in a system 
plasmid is induced by IPTG, a plasmid having a runaway 
replication origin allowing an increase in copy number by 
IPTG induction was constructed in accordance with the 
procedures shown in Figure 3. First, the plasmid 

10 pUC106AdP0 is constructed by removing the PvuII fragment 
containing the lac promoter and the operator region from 
the plasmid pUC106A, which is obtained by introducing the 
106 Ndel DNA fragment prepared from the two DNA strands 
whose base sequences are shown by SEQ ID NO: 3 and SEQ ID 

15 NO: 4 in the Sequence Listing into the Ndel site of the 
plasmid vector pUC19 (produced by Takara Shuzo). 

Next, the plasmid pHS2870 can be obtained by 
inserting the RNA1870 fragment obtained by PCR using the 
RNAIIA primer (base sequence shown by SEQ ID NO: 5 in the 

20 Sequence Listing) arranged near the RNAII region 

(replication origin of the plasmid) and the 1870 primer 
(base sequence shown by SEQ ID NO: 6 in the Sequence 
Listing) arranged at a terminus of the 106 Ndel sequence 
with the plasmid as a template, into the Xbal site of the 

25 plasmid pSTV28 (produced by Takara Shuzo). The plasmid 
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PHS2870 thus constructed is shown in Figure 4. Generally, 
the plasmid is present at a copy number of about 30 copies 
per Escherichia coll cell; however, this copy number is 
increased to several hundreds by the addition of IPTG. 

The Accl-Nspl fragment containing the P15A 
replication origin derived from the plasmid pSTV28 is then 
removed from the plasmid (plasmid pCRSOl ) , followed by 
further removal of the EcoRI-Xbal fragment containing an 
unnecessary restriction endonuclease site {plasmid 
pCRS02), after which a DNA fragment containing the lactose 
repressor (laclq) gene derived from the plasmid pMJR1560 
[Gene, Vol. 51, pp. 225-267 (1987)] is introduced to yield 
the plasmid pCRS04, A flow diagram of the construction of 
the plasmid pCRS04 from the plasmid pHS2870 is shown in 
Figure 5. The plasmid pCRS04 thus constructed is shown in 
Figure 6. The plasmid is a runaway plasmid induced by 
IPTG; generally, the plasmid is present at a copy number 
of 1 to 2 copies per Escherichia coll cell; however, this 
copy number is increased to several hundreds by the 
addition of IPTG. 

To control the expression of the desired gene 
inserted into the plasmid, the Psps-Oiac EX linker, a 
double-stranded oligonucleotide containing the SP6 
promoter sequence and lac operator sequence, may be 
introduced into the Nhel site of the plasmid to construct 
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the plasmid pACE601. The Psp6~0iac ^-^ linker is prepared 
from -the two DNA strands whose base sequences are shown by 
SEQ ID NO: 7 and SEQ ID NO: 8 in the Sequence Listing. The 
plasmid pACE601 is a runaway plasmid vector having the 
5 chloramphenicol resistance gene as a selection marker. ' 

Also^ by introducing the BspHI fragment containing 
the p-lactamase gene derived from the plasmid vector 
pUCllS (produced by Takara Shuzo ) into the Nhel site of 
the above-described plasmid pCRS04 (plasmid pCRS70), 

10 subsequently removing the NcoI-BsaAI fragment containing 

the chloramphenicol resistance gene, and replacing it with 
the above-described Psp6~Oiao linker, the plasmids pACEVGl 
and pACE702 can be constructed, which plasmids have the 
linker inserted in mutually opposite directions. These 

15 plasmids are runaway plasmid vectors having the ampicillin 
resistance gene as a selection marker. 

The potential of the thus-obtained plasmids pACE601, 
pACE701 and pACE702 used for control of the expression of 
the desired gene can be determined using the 

20 p-galactosidase gene as a reporter gene, as described 
above. It is possible to construct model expression 
plasmids by introducing a DNA fragment containing the 
p-galactosidase gene as amplified by PCR using the primers 
trpA-N-NcoI and lacZ-C-NcoI with the plasmid pMS434 [Gene, 

25 Vol. 57, pp. 89-99 (1987)] as a template, into the Ncol 
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site downstream of the SP6 promoter in each of the 
above-described three plasmids. The base sequences of the 
primers trpA-N-NcoI and lacZ-C-NcoI are shown by SEQ ID 
N0:9 and SEQ ID NO: 10 in the Sequence Listing, 
respectively. The plasmids thus obtained are designated 
PACE601Z, pACE701Z and pACE702Z, respectively. In 
BscJiericJiia coll transformed with these model expression 
plasmids, absolutely no p-galactosidase activity is 
detected, demonstrating very exact control of the 
expression of the p-galactosidase gene. 

Using the Nsp7524 III restriction endonuclease gene, 
it is possible to confirm that a gene whose expression 
product acts lethally on the host can be isolated and 
expressed using the plasmid of the present invention. The 
plasmid pBRN3 contains the Nsp7524 III restriction 
modification system gene. Escherichia coll MC1061 as 
transformed with the plasmid, designated Escherichia coll 
MCl061/pBRN3, has been deposited under accession number 
PERM BP-5741 at the National Institute of Bioscience and 
Human-Technology, Agency of Industrial Science and 
Technology, Ministry of International Trade and Industry 
[address: 1-3, Higashi 1 chome, Tsukuba-shi, Ibaraki-ken, 
305, Japan] since September 28, 1995 (date of original 
deposition). By a PCR-based DNA amplification reaction 
using a pair of the primers L-ORF and NspR-0RF3 with the 
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plasmid pBRN3 prepared from the transformant as a 
template, a DNA fragment containing the Nsp7524 III 
restriction endonuclease gene alone can be obtained. The 
base sequences of the primers L-ORF and NspR-0RF3 are 
5 shown by SEQ ID NO: 11 and SEQ ID NO: 12 in the Sequence 
Listing, respectively. By introducing into Escherichia 
coli HBlOl the plasmid pACE601-NspIII, which is obtained 
by inserting the resulting DNA fragment into the Ncol site 
downstream of the SP6 promoter of the above-described 

10 plasmid pACE601, the transformant Escherichia coli 

HB101/pACE601-NspIII can be obtained. The transformant 
allows the host to stably retain the Nsp7524 III 
restriction endonuclease gene despite the absence of the 
Nsp7524 III modification enzyme gene. 

15 Furthermore, the transformant Escherichia coli 

HB101/pFSP6/pACE601-NspIII can be prepared by introducing 
the above-described system plasmid pFSP6 into the 
transformant. By culturing the transformant and inducing 
expression by the addition of IPTG at the appropriate 

20 time, the possibility of production of the Nsp7524 III 

restriction endonuclease in the culture can be confirmed. 

The third aspect of the present invention provides a 
method for isolating a desired gene characterized by using 
the above-described plasmid vector. The isolation method 

25 of the present invention is suitably applied to the 
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130131:1011 of a gene whose expression product Is lethal or 
harmful against the host, especially a restriction 
endonuclease gene. By using the present method, it is 
possible to isolate a restriction endonuclease gene that 
5 has not been Isolated so far, e.g., the AccIII restriction 
endonuclease gene, even in the absence of the AccIII 
modification enzyme gene. Isolation of a restriction 
endonuclease gene can be achieved not only by the "shotgun 
method", wherein the genomic DNA of a microorganism that 

10 produces the desired enzyme is cleaved with the 

appropriate restriction endonuclease, and the resulting 
DNA fragment is inserted directly into, for example, the 
plasmid pACE611, but also by the use of a cassette library 
of the genomic DNA of a microorganism that produces the 

15 restriction endonuclease. By obtaining a gene by DNA 

amplification with the cassette library as a template, it 
is possible to express a gene prepared from a 
microorganism whose transcription mechanism differs from 
that of the host. The method of isolating a restriction 

20 endonuclease gene using a cassette library is hereinafter 
described with reference to an example involving the 
AccIII restriction endonuclease. 

A cassette library of the genomic DNA of a 
microorganism that produces the AccIII restriction 

25 endonuclease, i.e., an Acc bacterium, can be prepared as 
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follows: Genomic DNA is extracted from the cell culture 
of an Acc bacterium, and digested with the appropriate 
restriction endonuclease, after which the resulting DNA 
fragment is ligated to a cassette having a protruding end 
complementary to the fragment. Several similar cassette 
libraries are prepared using different restriction 
endonucleases for genomic DNA cleavage. These libraries 
are generically referred to as an Acc genomic cassette 
library. Next, the desired restriction endonuclease 
protein is purified from the Acc bacterium, its amino acid 
sequence is partially determined, and a primer is 
synthesized on the basis of the sequence. Using this 
primer and cassette primer, a PCR-based DNA amplification 
reaction is carried out with each cassette library as a 
template, to obtain a DNA fragment containing an Acclll 
restriction endonuclease gene fragment. The base sequence 
of the DNA fragment obtained is determined by, for 
example, direct sequencing of the PGR product, to 
determine the full-length base sequence of the AccIII 
restriction endonuclease gene. 

By designing a primer capable of amplifying the 
full-length sequence of the AccIII restriction 
endonuclease gene from the base sequence, and carrying out 
a PCR-based DNA amplification reaction using this primer 
with the genomic DNA of the Acc bacterium as a template, a 
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DNA fragment containing the full-length sequence of the 
AccIII restriction endonuclease gene is obtained. The DNA 
fragment obtained is inserted downstream of the SP6 
promoter of the plasmid pACE611 so that the codon frames 
5 are adjusted, and the resulting recombinant plasmid is, 

introduced into, for example, Escherichia coll JM109, to 
yield transf ormants. Transf ormants containing the AccIII 
restriction endonuclease gene in an expressible condition 
can be selected by culturing each transf ormant, inducing 

10 gene expression by the gene expression method of the 

present invention, and determining the AccIII restriction 
endonuclease activity in each culture obtained, as well as 
by drawing the restriction endonuclease map of the plasmid 
DNA harbored by each transf ormant . 

15 The AccIII restriction endonuclease gene thus 

obtained was actually inserted into the plasmid pACE611, 
and the resulting plasmid was designated pCRAlQ. The 
restriction endonuclease map of this plasmid is shown in 
Figure 7, wherein the bold solid line indicates the DNA 

20 fragment containing the AccIII restriction endonuclease 

gene. Escherichia coll JM1G9 as incorporating the plasmid 
pCRA19, designated Escherichia coll JM109/pCRA19, has been 
deposited under accession number FERM BP- 5743 at the 
National Institute of Bioscience and Human-Technology, 

25 Agency of Industrial Science and Technology, Ministry of 
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International Trade and Industry [address: 1-3, Higashi 1 
chorae, Tsukuba-shi, Ibaraki-ken, 305, Japan] since May 28, 
1996 (date of original deposition). The transformant 
stably retains the restriction endonuclease gene, despite 
5 the absence of the AccIII modification enzyme gene 

therein- By the above-described method, a restriction 
endonuclease gene can be isolated without the co-presence 
of a modification enzyme gene that constitutes a 
restriction modification system gene. 

10 The fourth aspect of the present invention provides a 

polypeptide possessing an activity of the AccIII 
restriction endonuclease. 

In the present specification, the term "AccIII 
restriction endonuclease" as a general term of 

15 polypeptides possessing an activity of the AccIII 

restriction endonuclease may be used in some cases. The 
AccIII restriction endonuclease of the present invention 
comprises the amino acid sequence described in SEQ ID N0:1 
in the Sequence Listing, for example. 

20 The AccIII restriction endonuclease of the present 

invention also includes, but not limited to the above, the 
polypeptide containing the entire or a portion of the 
amino acid sequence described in SEQ ID N0:1 in the 
Sequence Listing and possessing an activity of the AccIII 

25 restriction endonuclease. Furthermore, the polypeptide 
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having an amino acid sequence resulting from at least one 
of deletion, addition, insertion or substitution of one or 
more amino acid residues in the amino acid sequence of SEQ 
ID N0:1 or a portion thereof in the Sequence Listing and 
possessing an activity of the AccIII restriction 
endonuclease is included in the scope of the present 
invention. 

Generally, a naturally-occurring protein can undergo 
deletion, insertion, addition, substitution and other 
variations of amino acid residues in its amino acid 
sequence due to modifications, etc. of the protein in vivo 
or during purification, as well as those due to 
polymorphism and variation of the gene encoding it. 
Nevertheless, it is known that there are some such 
polypeptides which are substantially equivalent to 
variation- free proteins in terms of physiological and 
biological activity. Thus, those structurally different 
from the corresponding protein, but having no significant 
difference of function or activity from the protein is 
within the scope of the present invention. It is also the 
same when artificially introducing the above variations to 
the amino acid sequence of a protein and in this case, it 
is possible to produce more diverse variants. For 
example, the methionine residue at the N-terminus of a 
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protein expressed in Escherichia coli is reportedly often 
removed by the action of methionine aminopeptidase, but 
the removal is not completely done depending on the kinds 
of proteins, and some such expressed proteins have the 
methionine residue and others not. However, the presehce 
or absence of the methionine residue does not affect 
protein activity in most cases. It is also known that a 
polypeptide resulting from replacement of a particular 
cysteine residue with serine in the amino acid sequence of 
human interleukin 2 (IL-2) retains IL-2 activity [Science, 
224, 1431 (1984)] . 

In addition, in producing a protein by gene 
engineering, the desired protein is often expressed as a 
fused protein. For example, the N-terminal peptide chain 
derived from another protein is added to the N-terminus of 
the desired protein to enhance the expression of the 
desired protein, or purification of the desired protein is 
facilitated by adding an appropriate peptide chain to the 
N- or C-terminus of the desired protein, expressing the 
protein, and using a carrier showing affinity for the 
peptide chain added. Thus, even if the polypeptide has an 
amino acid sequence partially different from the AccIII 
restriction endonuclease of the present invention, it is 
within the scope of the present invention as long as it 
possesses essentially equivalent activity to the AccIII 



- 43 - 



restriction endonuclease of the present Invention. 

The AccIII restriction endonuclease can be obtained 
by, for example, culturing the above-described 
transformant Escherichia coli JM109/pCRA19, which contains 
5 the AccIII restriction endonuclease gene, adding the 
inducing agent IPTG at an appropriate time during its 
cultivation to increase the copy number of pCRA19, and 
subsequently infecting with the phage M13sp6. 

The AccIII restriction endonuclease can be harvested 

10 from the transformant culture by, for example, collecting 
cells from the culture, subsequently extracting the enzyme 
by ultrasonic disruption, ultracentrif ugation, etc., and 
then purifying the enzyme by a combination of nucleic acid 
removal, salting-out, affinity chromatography, gel 

15 filtration, ion exchange chromatography, etc. Because the 
above-described culture, which serves as the starting 
material for this purification, does not contain other 
restriction endonucleases, such as AccI and AccII, an 
enzyme preparation of desired purity can be obtained more 

20 easily than by conventional purification methods. 

The fifth aspect of the present invention provides a 
DNA encoding a polypeptide possessing an activity of the 
AccIII restriction endonuclease. 

The DNA of the present invention encoding a 

25 polypeptide possessing an activity of the AccIII 
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restriction endonuclease comprises a DNA encoding the 
amino acid sequence described in SEQ ID NO:l in the 
Sequence Listing, and includes but not limited to, a DNA 
comprising the base sequence described in SEQ ID NO: 2 in 
5 the Sequence Listing, for example. Specifically, the 
following DNAs are within the scope of the present 
invention. 

( 1 ) a DNA encoding a polypeptide which contains the 
entire or a portion of the amino acid sequence described 

10 in SEQ ID N0:1 in the Sequence Listing and possesses an 
activity of the AccIII restriction endonuclease; 

( 2 ) a DNA containing the entire or a portion of the DNA 
shown in SEQ ID NO: 2 in the Sequence Listing, wherein the 
expression product of the DNA possesses an activity of the 

15 AccIII restriction endonuclease; 

(3) a DNA encoding a polypeptide resulting from at least 
one of deletion, addition, insertion or substitution of 
one or more amino acid residues in the amino acid sequence 
of SEQ ID N0:1 in the Sequence Listing or a portion 

20 thereof and possessing an activity of the AccIII 
restriction endonuclease; and 

(4) a DNA capable of hybridizing to the DNA described in 
above (1) to (3), and encoding a polypeptide possessing an 
activity of the AccIII restriction endonuclease, etc. . 

25 In addition, if the hybridization is carried out 
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under the stringent condition using the above obtained DNA 
as a probe, a similar DNA somewhat different from the 
obtained DNA (SEQ ID NO: 2 in the Sequence Listing) but 
encoding a polypeptide possessing the same enzyme 
5 activity, can be obtained. Such a DNA is also includeci in 
the scope of the present invention. 

Such a stringent condition refers to that the 
membrane with DNA immobilized thereon is subjected to 

10 hybridization with the probe in a solution containing 6 x 
SSC (1 X SSC is a solution of 8.76 g of NaCl and 4,41 g of 
sodium citrate in 1 liter of water), 1% SOS, 100 pg/ml 
salmon sperm DNA, 0.1% bovine serum albumin, 0.1% 
polyvinylpyrrolidone and 0.1% Ficoll, incubating at 65 "C 

15 for 20 hours, for example. 

Methods for obtaining similar DNA encoding the AccIII 
restriction endonuclease by hybridization include, for 
example, the following method. 

First, DNA obtained from an appropriate gene source 

20 is ligated to a plasmid or a phage vector by a 

conventional method to yield a DNA library. This library 
is introduced into an appropriate host; the resulting 
transformants are cultured on plates; colonies or plaques 
that have grown are transferred onto nitrocellulose or 

25 nylon membranes and denatured, after which the DNA is 
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fixed onto the membrane. These membranes are incubated 
for hybridization in a solution of the above-described 
composition containing a probe previously labeled with ^^P 
etc. (probe used may be any polynucleotide encoding the 
5 entire or a portion of the amino acid sequence shown by 
SEQ ID N0:1 in the Sequence Listing, exemplified by a 
polynucleotide consisting of, or containing, the entire or 
a portion of the base sequence shown by SEQ ID NO: 2 in the 
Sequence Listing) under the conditions shown above. After 

10 completion of the hybridization, the non-specif ically 

adsorbed probe is washed out, followed by autoradiography 
etc., to identify clones that have hybridized to the 
probe. This procedure is repeated until the desired 
hybridizing clone is isolated. The clone thus obtained 

15 retains DNA encoding a polypeptide having the desired 
enzyme activity. 

The DNA obtained is determined for base sequence to 
confirm that it encodes the desired enzyme protein, by, 
for example, the method described below. 

20 For base sequencing, the transformant is cultured in 

a test tube etc. when the host is Escherichia coll, and a 
plasmid is prepared by a conventional method, provided 
that the transformant has been prepared using a plasmid 
vector. Using the plasmid obtained as a template as is, 

25 or after the insert is taken out and subcloned into the 
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M13 phage vector etc., the base sequence is determined by 
the dideoxy method. In the case of a transformant 
prepared using a phage vector as well, the base sequence 
can be determined by basically the same procedures. 
5 These basic experimental processes from cultivation 

to base sequencing are described in, for example. 
Molecular Cloning: A Laboratory Manual, 1982, T. Maniatis 
et al., published by Cold Spring Harbor Laboratory. 
Whether or not the DNA obtained is similar DNA 

10 encoding the desired AccIII restriction endonuclease can 
be confirmed by comparing the determined base sequence 
with the base sequence shown by SEQ ID NO: 2 in the 
Sequence Listing, or by comparing the amino acid sequence 
deduced from the determined base sequence with the amino 

15 acid sequence shown by SEQ ID N0:1 in the Sequence 
Listing. 

When the DNA obtained does not contain the entire 
portion of the region encoding the desired restriction 
endonuclease, the entire encoding region can be obtained 
20 by synthesizing a primer on the basis of the base sequence 
of the DNA obtained, and amplifying the lacking region by 
PGR using the primer, or repeating screening the DNA 
library using the DNA fragment obtained as a probe. 

It is possible to prepare a transformant containing 
25 the thus-obtained similar DNA encoding the AccIII 



- 48 - 



restriction endonuclease, to allow the transf ormant to 
express the enzyme protein encoded by the DNA, and to 
purify the enzyme protein expressed. Preparation of the 
transformant and expression and purification of the enzyme 
protein can be all achieved using the plasmid of the ■ 
present invention. The enzyme protein thus obtained 
retains AccIII restriction endonuclease activity. 

Additionally, the AccIII modification enzyme and DNA 
encoding the enzyme, both of which have not been obtained 
so far, can also be obtained using the above-described DNA 
encoding the AccIII restriction endonuclease. 

For example, on the basis of the mutually close 
location of a restriction endonuclease gene and a 
modification enzyme gene in many cases, this purpose can 
be accomplished by obtaining a gene region encoding a 
protein near the restriction endonuclease gene by a DNA 
amplification reaction with a cassette library as a 
template, inserting it into an appropriate expression 
vector to allow the gene to be expressed, and confirming 
AccIII modification enzyme activity by an appropriate 
method. AccIII modification enzyme activity can also be 
confirmed, for example, on the basis of the resistance of 
the DNA prepared from the transformant to the cleavage 
activity of the AccIII restriction endonuclease. Provided 
that the base sequence of the above-described gene region 
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is determined to confirm homology to a conserved region 
between modification enzyme genes of a known restriction 
modification system [Journal of Molecular Biology, Vol. 
206, pp. 305-321 (1989)], it can be anticipated to some 
extent before gene expression that the gene is a 
modification enzyme gene. 

By using a cassette library of the genomic dna of the 
Acc bacterium as described above, the AccIII modification 
enzyme and DNA encoding it can be obtained. Its amino 
acid sequence and base sequence are shown by SEQ ID NO: 13 
and SEQ ID NO: 14 in the Sequence Listing, respectively. 

AccIII modification enzyme herein is not limited to 
the above described. As stated in the description of 
AccIII restriction endonuclease, it is a polypeptide 
containing the entire or a portion of the amino acid 
sequence described in SEQ ID No: 13 in the Sequence Listing 
and possessing the AccIII modification enzyme activity. 
Furthermore, the polypeptide resulting from at least one 
of deletion, addition, insertion or substitution of one or 
more amino acid residues in the amino acid sequence of SEQ 
ID NO: 13 or a portion thereof in the Sequence Listing and 
possessing the AccIII modification enzyme activity is also 
included in the scope of the present invention. 

A DNA encoding AccIII modification enzyme in the 
present invention herein comprises a DNA encoding the 
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amino acid sequence described in the SEQ ID NO: 13 in the 
Sequence Listing, and includes, but not limited to, a DNA 
comprising the base sequence described in the SEQ ID NO: 14 
in the Sequence Listing, for example. Specifically, the 
following DNAs are within the scope of the present 
invention. 

{ 1 ) a DNA encoding a polypeptide containing the entire or 
a portion of the amino acid sequence described in SEQ ID 
NO: 13 in the Sequence Listing and possessing the AccIII 
modification enzyme activity; 

(2) a DNA containing the entire or a portion of the DNA 
shown in SEQ ID NO: 14 in the Sequence Listing, wherein the 
expression product of the DNA possesses the AccIII 
modification enzyme activity; 

(3) a DNA encoding a polypeptide resulting from at least 
one of deletion, addition, insertion or substitution of 
one or more amino acid residues in the amino acid sequence 
of SEQ ID NO: 13 or a portion thereof in the Sequence 
Listing and possessing the AccIII modification enzyme 
activity; and 

(4) a DNA capable of hybridizing to the DNA described in 
above (1) to (3), and encoding a polypeptide possessing 
the AccIII modification enzyme activity, etc. . 

The present invention is hereinafter described in 
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more detail by means of the following reference example 
and working examples, which examples are not to be 
construed as limitative. Of the procedures described 
herein, basic ones regarding plasmid preparation, 
restriction endonuclease digestion, etc. were achieved' in 
accordance with the methods described in Molecular 
Cloning: A Laboratory Manual, 2nd edition, edited by T. 
Maniatis et al., published by Cold Spring Harbor 
Laboratory, 1989. 

Reference Example 

( 1 ) Culture medium and conditions 

Escherichia, coll was aerobically cultured at 37 °C 
using LB medium (1% trypton, 0.5% yeast extract, 0.5% 
NaCl, pH 7.0). Antibiotics were each added to the medium 
at various concentrations depending on the plasmid 
retained by the Escherichia coli as follows: 25 pg/ml 
kanamycin for pFSP6, 20 pg/ml ampicillin for pXX325, 50 
pg/ml ampicillin for ampicillin-resistant ColEl type 
plasmid, and 100 pg/ml ampicillin for ampicillin-resistant 
pUC type plasmid. In the expression induction experiment, 
the culture broth obtained after cultivation until the 
stationary phase was inoculated to a fresh medium at 1%, 
then aerobically cultured at 37 "C, after which 
isopropyl-(3-D-thiogalactoside (IPTG) was added to a final 
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concen-tra-tion of 0.2 mM upon reach of an OD^^^ value of 0.6 
(6 X 10^ cells/ml), followed by further cultivation. 

(2) Construction of the system plasraid pFSP6 

The system plasmid pFSP6 was constructed with 
Escherichia coll JM109 as a host, according to the 
procedure directed in Figure 8. The plasmid pSP6-2 
[Nucleic Acids Research, Vol. 15, pp. 2653-2664 (1987)] 
was digested with Hindlll (produced by Takara Shuzo) and 
blunted at both ends, after which it was further digested 
with BaraHI (produced by Takara Shuzo) to yield an about 
2.8 kb DNA fragment containing the SP6 RNA polymerase 
gene, which fragment was mixed with the plasmid vector 
pUClB (produced by Takara Shuzo), previously digested with 
BaraHI and Hindi (produced by Takara Shuzo), for ligation 
to construct the plasmid pUCSP. Next, the plasmid 
PMJR1560 [Gene, Vol. 51, pp. 225-267 (1987)] was digested 
with Kpnl and blunted at both ends, after which it was 
further digested with PstI (produced by Takara Shuzo) to 
yield an about 1.3 kb DNA fragment containing the laclq 
gene, which was then isolated. The above plasmid pUCSP 
was digested with Hindlll and blunted at both ends, after 
which it was further digested with PstI to yield a DNA 
fragment, which was mixed with the above-described about 
1.3 kb DNA fragment for ligation to construct the plasmid 
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pUCSPlac. 

Furthermore, an about 1.5 kb DNA fragment obtained by 
PstI digestion of pUCKm [Journal of Molecular Biology, 
Vol. 147, pp. 217-226 (1981)], which contains the 
kanamycin resistance gene, was introduced into the PstI 
site of the above-described plasmid pUCSFlac to construct 
the plasmid pUCSPlacKm. The plasmid was digested with 
Aatll (produced by Toyobo), after which it was partially 
digested with Nspl (produced by Takara Shuzo) to isolate 
an about 6.5 kb DNA fragment, which was then blunted at 
both ends. 

On the other hand, the plasmid pXX325 [Proceedings of 
the National Academy of Sciences of the USA, Vol. 80, pp. 
4784-4788 (1983)] was digested with Hindlll and blunted at 
both ends, after which it was further digested with Sail 
to isolate an about 6-8 kb DNA fragment containing the 
replication origin of the minxF plasmid, which fragment 
was then mixed with the above-described about 6.5 kb DNA 
fragment for ligation to construct the plasmid pFSP6. 
Escherichia coli HBlOl was transformed with the plasmid to 
form Escherichia coli HB101/pFSP6. 

(3) Construction of multicopy model expression plasmids 

The Pgpg linker, a double-stranded oligonucleotide 
containing the minimum region of the SP6 promoter, was 
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prepared from "the two DNA strands whose base sequences are 
shown by SEQ ID NO: 15 and SEQ ID NO: 16 in the Sequence 
Listing, respectively, and was mixed with the plasmid 
pMS434 [Gene, Vol. 57, pp. 89-99 (1987)], previously 
5 digested with Xhol (produced by Takara Shuzo) and Hindlll, 
for ligation to construct the model expression plasmid 
pMSP6L inserted the above-described promoter sequence 
upstream of the p-galactosidase gene on the plasmid 
( Figure 1 ) . 

10 On the other hand, by inserting a DNA fragment 

containing the inherent SP6 promoter sequence, obtained by 
digesting pSP64 [Nucleic Acids Research, Vol. 12, pp. 
7035-7056 (1984)] with AccII and Hindlll, into the 
Xhol-Hindlll site of the above-described plasmid pMS434, 

15 the model expression plasmid pMSP6F was constructed 
(Figure 1). Furthermore, the Pspe-^iac linker, which 
contains the minimum region of the SP6 promoter and the 
lac operator region, was prepared from the two DNA strands 
whose base sequences are shown by SEQ ID NO: 17 and SEQ ID 

20 NO: 18 in the Sequence Listing, respectively, and was 
inserted between the Xhol-Hindlll site of the 
above- described plasmid pMS434 to construct the model 
expression plasmid pMSPGO (Figure 2). 

25 (4) Promoter activity of the SP6 promoter for Escherichia 
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coli RNA polymerase 

The plasmids pMSP6L and pMSPSF constructed in 
Reference Example (3) and the plasmid pMS434 as a control, 
which does not contain the SP6 promoter sequence, were 
5 each introduced into the Escherichia coli shown in Table 
1. The resulting transf ormants were each cultured by the 
method described in Reference Example ( 1 ) , after which 
each cell culture broth was harvested during the 
logarithmic growth phase. The ODjqq value of each culture 

10 broth collected was determined with a portion thereof, 
while the remaining portion was transferred to a 
pre-cooled test tube and supplemented with chloramphenicol 
to a final concentration of 100 rag/ml. With this culture 
broth as a sample, p-galactosidase activity was determined 

15 by the method described in Experiments in Molecular 

Genetics, edited by J-H. Miller, published by Cold Spring 
Harbor Laboratory, 1972. 

As shown in Table 2, Escherichia coli MC41 00 
incorporating the model expression plasmid pMSP6L or 

20 pMSP6F, both harboring a sequence containing the SP6 

promoter, exhibited similar level of low p-galactosidase 
activity, irrespective of the promoter sequence inserted. 
However, that incorporating the plasmid pMS434, which does 
not contain the SP6 promoter, exhibited only lower enzyme 

25 activity. In addition, comparing Escherichia coli MRi7 
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and MRiSO as the hosts revealed decreased p-galactosidase 
activity in MRiSO, wherein the copy number of the plasmid 
was also decreased. 

(5) Evaluation of the system plasmid pFSP6 

Each transformant prepared in Reference Example (4) 
into which the system plasmid pFSP6 was further 
introduced, was cultured by the method described in 
Reference Example (1); p-galactosidase activity was 
determined both in a non-inductive condition and in an 
inductive condition before and after IPTG addition. The 
results were shown in Table 3. All transf ormants 
exhibited p-galactosidase activity in a non-inductive 
condition at a level similar to that obtained in the 
absence of the plasmid pFSP6 shown in Table 2. 

On the other hand, with induction by IPTG addition, 
the p-galactosidase activity in the transf ormants 
incorporating the plasmids pMSP6L and pMSP6F was 18 to 32 
times compared with that obtained in a non-inductive 
condition, while there was no increase in the activity in 
the transf ormants incorporating the plasmid pMS434. 

(6) Effect of the lac operator sequence on expression 
control 

Transformants obtained by introducing the plasmids 
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pMSP6L, pMSP6F, pMSP60 and pMS434, respectively, into 
Escherichia coli MC4100 incorporating the plasmid pFSP6 
were each cultured by the method described in Reference 
Example (1), and p-galactosidase activity in a 
5 non-inductive condition was determined. The results are 
shown in Table 4. The transformant incorporating the 
plasmid pMSP60, which contains the lac operator sequence, 
still showed some p-galactosidase activity, although the 
activity level was lower than those obtained with the 
10 plasraids pMSP6L and pMSP6F. In short, expression in a 

non-inductive condition could not be completely suppressed 
simply by introduction of the lac operator sequence. 

Table 4 



Strain 


jQ - Galactosidase Activity (Non-inductive Condition) 


PMSP6F 


PMSP6L 


PMSP60 


PMS434 


MC4100(pFSP6) 


221 


234 


30 


25 



20 

Example 1 

( 1 ) Culture medium and conditions 

Escherichia coli was aerobically cultured at 37 "C 
using LB medium (1% trypton, 0-5% yeast extract, 0.5% 
25 NaCl, pH 7.0). Antibiotics were each added to the medium 
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at various concentrations depending on the plasmid 
retained by the Escherichia coli as follows: 25 iig/ml 
kanamycin for pFSP6, 50 pg/ml ampicillin for 
ampicillin-resistant ColEl type plasmid, 100 pg/ml 
ampicillin for ampicillin-resistant pUC type plasmid, , 
30 vig/n»l chloramphenicol for chloramphenicol-resistant 
runaway plasmid and 30 pg/ml ampicillin for ampicillin- 
resistant runaway plasmid. In the expression induction 
experiment, the culture broth obtained after cultivation 
until the stationary phase was inoculated to a fresh 
medium at 1%, then aerobically cultured at 37 °C, after 
which isopropyl-p-D-thiogalactoside (IPTG) was added to a 
final concentration of 0.2 mM upon reach of an OTt^oo value 
of 0.6 (6 X 10^ cells/ml), followed by further cultivation. 



(2) Construction of the runaway plasmid pHS2870 

The runaway plasmid pHS2870, which provided a basis 
for construction of the expression plasmid, was 
constructed by the procedures shown in Figure 3. The 105 
Ndel DNA fragment prepared from the two DNA strands whose 
base sequences are shown by SEQ ID NO: 3 and SEQ ID NO: 4 in 
the Sequence Listing, respectively, was mixed with the 
plasmid vector pUC19, previously digested with Ndel, for 
ligation. The plasmid pUC106A thus obtained was digested 
with PvuII and subjected to self -ligation to yield the 
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yield the plasmid pUC106AdPO. With this plasmid 
pUC106AclPO as a template, PGR was then conducted using the 
RNAIIA primer (whose base sequence shown by SEQ ID NO: 5 in 
the Sequence Listing) and the 1870 primer (whose base 
sequence shown by SEQ ID NO: 6 in the Sequence Listing), to 
yield an amplified DNA fragment, which was then digested 
with Xbal and mixed with the plasmid pSTV28, previously 
digested with Xbal, for ligation to yield the plasmid 
pHS2870. The construct of the plasmid pHS2870 is shown in 
Figure 4. 

(3) Construction o£ the runaway plasmid pCRS04 

After digestion with AccI (produced by Takara Shuzo) 
and Nspl, the above-described plasmid pHS2870 was blunted 
at both ends and subjected to self -ligation to yield the 
plasmid pCRSOl, which lacks the P15A replication origin. 
The plasmid was then digested 'with EcoRI ( produced by 
Takara Shuzo) and Xbal, after which it was blunted at both 
ends and subjected to self-ligation in the same manner as 
above to construct the plasmid pCRS02 . After an about 1.2 
kb DNA fragment obtained by digesting the plasmid pMJR1560 
with Kpnl (produced by Takara Shuzo) and PstI was blunted 
at both ends, it was mixed with the above-described 
plasmid pCRS02, previously digested with NspV and Vspl 
(both produced by Takara Shuzo) and then blunted at both 
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ends, for ligation to yield the plasmid pCRS04. The flow 
diagram of the construction of the plasmid pCRS04 is shown 
in Figure 5, and the construct of the plasmid pCRS04 shown 
in Figure 6 . 

(4) Construction of the runaway expression vector pACE601 

The above-described plasmid pCRS04, previously 
digested with Nhel (produced by Takara Shuzo) and 
subsequently blunted at both ends, was mixed with the 
Psp6~0iac EX linker, prepared from the two DNA strands whose 
base sequences are shown by SEQ ID NO: 7 and SEQ ID NO: 8 in 
the Sequence Listing, for ligation to construct the 
plasmid pACE601. The flow diagram of the construction of 
the plasmid pACE601 is shown in Figure 9. The 
construction of these plasmids were conducted with 
Escherichia coli JM109 as a host. 

(5) Construction of the runaway expression vector pACE611 

The runaway expression vector pACE601 was digested 
with Xhol-Hindlll, and the Pspg linker, consisting of the 
synthetic oligo-DNAs shown by SEQ ID NO: 15 and SEQ ID 
NO: 16 in the Sequence Listing, was inserted into that site 
to construct the runaway expression vector pACE611. The 
construct of pACE611 is shown in Figure 10. The flow 
diagram of the construction of the plasmid pACE611 is 
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shown in Figure 11. 

(6) Construc-tion of -the runaway expression vectors 
pACE701 and pACE702 
5 An about 1 kb DNA fragment obtained by digesting the 

plasmid vector pUCllS (produced by Takara Shuzo) with 
BspHi (produced by NEB) was blunted at both ends, after 
which it was mixed with the above-described plasmid 
pCRS04, previously digested with Nhel, and blunted at both 

10 ends, for ligation to construct the plasmid pCRSVO. The 
flow diagram of the construction of the plasmid pCRS70 is 
shown in Figure 12. Next, the plasmid was digested with 
Ncol (produced by Takara Shuzo) and BsaAI (produced by 
NEB), after which it was blunted at both ends and mixed 

15 with the above-described Psp6~0iac linker for ligation to 
construct two plasmids pACE701 and pACE702, which 
incorporate the linker inserted in mutually opposite 
directions. The flow diagrams of the construction of the 
plasmids pACE701 and pACE702 are shown in Figure 13. The 

20 construction of these plasmids were conducted with 
Escherichia coll JM109 as a host. 



25 



(7) Construction of model runaway expression plasmids 

PGR was conducted using the primers trpA-N-NcoI and 
lacZ-C-NcoI with the above-described plasmid pMS434 as a 
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template, to yield a DNA fragment containing the 
p-galactosidase gene. The base sequences of the primers 
trpA-N-NcoI and lacZ-C-NcoI are shown by SEQ ID NO: 9 and 
SEQ ID NO: 10 in the Sequence Listing, respectively. After 
5 digestion with Ncol, the fragment was mixed with each of 

the above-described plasmids pACE601, pACE701 and pACE702, 
all previously digested with Ncol, for ligation to yield 
the model runaway expression plasmids pACE601Z, pACE701Z 
and pACE702Z, all incorporating the p-galactosidase gene 
10 introduced downstream of the Psps-Oi^e sequence. The 
construction of these plasmids were conducted with 
Escherichia coli JM109 as a host. 



( 8 ) Evaluation of the expression levels of runaway 
15 expression plasmid in a non-inductive condition 

The transformants MC4100/pFSP6/pACE6QlZ, 
iyiC4100/pFSP6/pACE701Z and MC4100/pFSP6/pACE702Z, which 
were obtained after introducing the above-described model 
runaway expression plasmids pACE601Z, pACE701Z and 
20 pACE702Z, respectively into Escherichia coli MC4100 

incorporating the above-described system plasmid pFSP6 
(hereinafter referred to as MC4100/PFSP6 ) , were each 
cultured under the conditions described in Example ( 1 ) ; 
each culture broth collected during the logarithmic growth 
25 phase was assayed for p-galactosidase activity by the 
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method described in Reference Example ( 4 ) . In all 
■transformants examined, p-galactosidase activity was below 
the detection limit, demonstrating a greater 
expression-suppressing effect than that obtained with the 
5 multicopy plasmids shown in Reference Example (4). 

(9) Construction of the Nsp7524 III restriction 
endonuclease gene expression plasmid 
The plasmid pBRN3, which contains the Nsp7524 III 

10 restriction modification system gene, was prepared from 
Escherichia coll MCl06l/pBRN3 (ferm bp-5741). pgr was 
conducted using the primers L-ORF and NspR-0RF3 with this 
plasmid as a template to yield an about 1 kb DNA fragment 
containing the Nsp7524 III restriction endonuclease gene 

15 alone. The base sequences of the primers L-ORF and 

NspR-0RF3 are shown by SEQ ID NO: 11 and SEQ ID NO: 12 in 
the Sequence Listing, respectively. After digestion with 
Ncol, the fragment was mixed with the above-described 
plasmid pACE601, previously digested with Ncol, for 

20 ligation to construct the plasmid pACE601-NspIII, which 

incorporates the Nsp7524 III restriction enzyme gene alone 
introduced downstream of the PsP6~0iac sequence. The plasmid 
was stably retained in Escherichia coli JM109 not 
containing the Nsp7524 III modification enzyme gene. 



25 
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(10) Construction of Nsp7524 III restriction endonuclease 
gene expression system 

The transformants HB101/pFSP6/pACE601-NspIII and 
HBlOl/pFSP6/pACE601, which were obtained after introducing 
5 the above-described plasmid pACE601-NspIII , which contains 
the Nsp7524 III restriction endonuclease gene, and the 
control plasmid pACE601 into Escherichia coll HBlOl 
incorporating the above-described system plasmid pFSP6 
(hereinafter referred to as HB101/PFSP6 ) , were each 

10 cultured in LB medium until the stationary phase; each 
culture broth collected was inoculated to two tubes of 
fresh medium at 1% and aerobically cultured at 37 "C Upon 
reach of an ODgoo value of 0.6, IPTG was added to one of 
the tubes to a final concentration of 0.2 mM, followed by 

15 further cultivation. After completion of the cultivation, 
cells were harvested, suspended in cell disruption buffer 
A (20 mM Tris-HCl, pH 7.5, 10 mM 2-mercaptoethanol ) , and 
disrupted by ultrasonication, followed by centrif ugation 
to yield a crude extract. 

20 A 1 yil portion of this crude extract was added to 30 

Vil of a reaction mixture (10 mM Tris-HCl, pH 7.5, 10 mM 
MgClj, 1 mM DTT, 50 mM NaCl, 1 pg X-DNA) and reacted at 
37 °C for 1 hour, after which the reaction mixture was 
subjected to agarose gel electrophoresis to examine for 

25 the digestion of X-DNA and confirm restriction 
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endonuclease activity. As shown in Figure 14, restriction 
endonuclease activity was observed only in 

HB101/pFSP6/pACE601-NspIII with expression induced by the 
addition of IPTG, and its X,-DNA cleavage pattern agreed 
5 with that of Aval, an isoschizomer of Nsp7524 III. 

The results of the above-described X-DNA digestion 
reaction as conducted for extended periods of 2 and 3 
hours are shown in Figure 15. In this case, because the 
crude extract used in the present experiment contained 

10 nuclease derived from the host Escherichia coli, the DNA 
fragment resulting from Nsp7524 III activity underwent 
further degradation by Escherichia coli nuclease, 
resulting in bands whose density decreased as the increase 
in reaction time, on lanes 8 through 10, with no bands 

15 detected on lane 10. On lanes 11 through 13, the Nsp7524 
III digestion fragment was not produced because of the 
absence of Nsp7524 III induction even when reaction time 
was extended, demonstrating that the ?^-DNA was made into 
the lower molecular weight fragments by the direct action 

20 of Escherichia coli nuclease. No restriction endonuclease 
activity was detected in the crude extract from 
HB101/pFSP6/pACEA601 . 

Example 2 

25 Isolation and expression of the AccIII restriction 
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endonuclease gene 

(1) Determination of N-terminal amino acid sequence of 

AccIII restriction endonuclease protein and synthesis 
of primer DNA corresponding to the amino acid 
5 sequence 

About 2 ku of a commercial product of the AccIII 
restriction endonuclease (produced by Takara Shuzo) was 
subjected to gel filtration using a column of Sephacryl 
S300 (produced by Pharmacia) to determine the molecular 
10 weight of the AccIII restriction endonuclease. Judging 

from the elution position where activity was detected, the 
molecular weight of the AccIII restriction endonuclease 
was proven to be about 70,000. Because most restriction 
endonuclease proteins are dimers, the AccIII restriction 
15 endonuclease protein was expected to be mobilized to a 
position for a molecular weight of about 35,000 in SDS 
polyacrylamide gel electrophoresis. 

Next, to obtain a sample for N-terminal amino acid 
sequencing of the AccIII restriction endonuclease protein, 
20 about 2 ku of a commercial product of the AccIII 
restriction endonuclease, together with a protein 
molecular weight marker, was subjected to SDS 
polyacrylamide gel electrophoresis, after which the 
protein was transferred from the gel to a PVDF membrane 
25 and stained with bromophenol blue to confirm the protein 
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position. After destaining with 10% acetic acid-50% 
methanol, the portion of the PVDF membrane where a protein 
of about 35,000 molecular weight was blotted was cut out 
and subjected to automatic Edman degradation using the 
5 protein sequencer GIOOOA (produced by Hewlett-Packard) to 
determine the N-terminal amino acid sequence shown by SEQ 
ID NO: 19 in the Sequence Listing. On the basis of this 
sequence, AccIII primers 1 and 2, shown by SEQ ID NO: 20 
and SEQ ID NO: 21 in the Sequence Listing, respectively, 
10 were then synthesized for use as a pair of cassette 
primers. 

( 2 ) Preparation of genomic DNA of Acc bacterium 

In accordance with the method of Kita et al . , 

15 described in Nucleic Acids Research, Vol. 13, pp. 

8685-8694 (1985), the Acc bacterium was cultured to obtain 
wet cells. Two grams of the wet cells obtained was 
suspended in 10 ml of buffer B [25 mM Tris-HCl (pH 8.0), 
50 mM glucose, 10 mM EDTA] , stirred in the presence of 1 

20 ml of a lysozyme solution prepared to 2 mg/ml in buffer B, 
and kept standing at 37 °C for 20 minutes. Next, 28 ml of 
buffer C [100 mM NaCl, 100 mM Tris-HCl (pH 8.0)] was added 
to this solution, followed by stirring. One milliliter of 
a proteinase K solution prepared to 20 mg/ml in TE [10 mM 

25 Tris-HCl, 1 mM EDTA (pH 8.0)] and 4 ml of a 10% SOS 
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solution were further added, followed by stirring, after 
which the mixed solution was kept standing at 37 "C for 1 
hour. 

To this solution, 6 ml of a 5 M aqueous solution of 
NaCl and 6 ml of buffer D [10% CTAB (cetyl trimethyl 
ammonium bromide), 0.7 M NaCl] was added, followed by 
stirring, after which the mixed solution was kept standing 
at 60 °C for 20 minutes. This solution was treated with 
phenol and subsequently with chloroform, after which the 
water layer was separated. To the water layer, 50 \il of 
an RNaseA (produced by Sigma) solution prepared to 10 
mg/ml in TE was added, followed by stirring, after which 
the mixed solution was kept standing at 37 °C for 40 
minutes. After being kept standing, this solution was 
treated with phenol and subsequently with chloroform, 
after which the water layer was separated. To the water 
layer, an equal volume of cold ethanol was added; the DNA 
precipitated was recovered by winding around a glass 
capillary. The DNA was washed with 70% ethanol and 
dissolved in 3 ml of TE to yield about 200 pg of genomic 
DNA. 

(3) Preparation of Acc genomic cassette library 

The following procedures were conducted basically in 
accordance with the method described on pages F16-F17 in 
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"Gene Engineering Guide" (1995-1996 edition), Takara 
Shuzo . 

The genomic DNA obtained in ( 2 ) was completely 
digested with EcoRI (produced by Takara Shuzo); the DNA 
5 fragment obtained was ligated with an EcoRI cassette 
( produced by Takara Shuzo ) , having a cohesive end 
complementary thereto, to yield an EcoRI cassette library. 
Similarly, the genomic DNA was completely digested 
separately with the restriction endonucleases Bglll, 

10 ECOT14I, ECOT22I, Hindlll, PstI, Sail and Xbal (all 

produced by Takara Shuzo ) . The genomic DNA fragments 
obtained were each bound to each of several cassettes 
( produced by Takara Shuzo ) , having a complementary 
protruding end, to yield the Bglll, EcoT14l, EcoT22l, 

15 Hindlll, PstI, Sail and Xbal cassette libraries, 

respectively. These cassette libraries are generically 
referred to as the Acc genomic cassette library. 

(4) Analysis of the AccIII restriction endonuclease gene 
20 A PCR-based DNA amplification reaction was carried 

out using a primer pair of AccIII primer 1 and cassette 
primer CI (produced by Takara Shuzo) with the EcoRI 
cassette library obtained in (3) as a template. To 
efficiently and specifically amplify the desired region, a 
25 second PCR-based DNA amplification reaction was carried 
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out using a primer pair of AccIII primer 2 and cassette 
primer C2 (produced by Takara Shuzo) with a portion of the 
reaction liquid obtained, to yield an amplified DNA 
fragment. Similarly, with the Bglll, EcoT14I, EcoT22I, 
5 Hindlll, PstI, Sail and Xbal cassette libraries as a 

template respectively, the above two-step PCR-based DNA 
amplification reaction was carried out separatedly to 
yield amplified DNA fragments- Each amplified DNA was 
analyzed by agarose gel electrophoresis; the amplified DNA 

10 fragment derived from the Xbal cassette library showed a 
particularly high amplification efficiency. 

With this in mind, a PCR-based DNA amplification 
reaction was again carried out using a primer pair of 
AccIII primer 1 and cassette primer CI with the Xbal 

15 cassette library as a template. This reaction mixture was 
subjected to agarose gel electrophoresis; an about 0.5 kb 
amplified DNA fragment was recovered from the gel. A 
second PCR-based DNA amplification reaction was carried 
out using a primer pair of AccIII primer 2 and cassette 

20 primer C2 with the DNA fragment obtained as a template. 

Base sequencing of the about 0.5 kb amplified DNA fragment 
obtained demonstrated that the fragment encodes a portion 
of the protein whose N-terminal amino acid sequence was 
determined in ( 1 ) . On the other hand, the putative 

25 molecular weight of the AccIII restriction endonuclease 
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protein is about 35,000, and the gene encoding the protein 
is assumed to be about 1 kb in length. It was therefore 
expected that the base sequence of the AccIII restriction 
endonuclease gene could be determined, provided that 
information on the accurate base sequence of the 
5 '-terminal region to which AccIII primers 1 and 2 
annealed, and information on the remaining about 0.5 kb 
base sequence in the 3' region, were available, in 
addition to the above-described information on the about 
0.5 kb base sequence. 

With this in mind, to determine the accurate base 
sequence of the 5 ' -terminal region of the AccIII 
restriction endonuclease gene region, AcclII primer 3, 
shown by SEQ ID NO: 22 in the Sequence Listing, and AccIII 
primer 4, shown by SEQ ID NO: 23 in the Sequence Listing, 
were synthesized. Next, a PCR-based DNA amplification 
reaction was carried out using a primer pair of AccIII 
primer 4 and cassette primer CI with the EcoRI cassette 
library obtained in (3) as a template. A PCR-based DNA 
amplification reaction was carried out using a primer pair 
of AccIII primer 3 and cassette primer C2 with a portion 
of this reaction mixture. Similarly, with the Bglll, 
ECOT14I, ECOT22I, Hindlll, PstI, Sail and Xbal cassette 
libraries as a template respectively, the above two-step 
PCR-based DNA amplification reaction was separatedly 
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carr-ied out. Of "the DNA fragments amplified, the 
Shortest, i.e., the about 1 kb DNA fragment was obtained 
with the ECOT14I cassette library as a template, the base 
sequence thereof was determined. 
5 Furthermore, to determine the about 0.5 kb sequence 

on the 3 ' side of the gene region assumed to encode the 
AccIII restriction endonuclease protein, AccIII primer 5, 
shown by SEQ ID NO: 24 in the Sequence Listing, and AccIII 
primer 6, shown by SEQ ID NO: 25 in the Sequence Listing, 
10 were synthesized. A PCR-based DNA amplification reaction 
was carried out using a primer pair of AccIII primer 5 and 
cassette primer CI with the EcoRI cassette library 
obtained in (3) as a template. A PCR-based DNA 
amplification reaction was carried out using a primer pair 
15 of AccIII primer 6 and cassette primer C2 with a portion 
of this reaction mixture. Similarly, with the Bglll, 
ECOT14I, ECOT22I, Hindlll, PstI, Sail and Xbal cassette 
libraries as a template respectively, the above two-step 
PCR-based DNA amplification reaction was carried out 
20 separatedly. 

Of the DNA fragments amplified, the about 0.5 kb and 
about 0.8 kb DNA fragments each obtained with the EcoT22l 
and Hindi I I cassette libraries as a template respectively 
were subjected to a base sequencing. Combining the 
25 results for the three base sequences determined, the base 
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sequence information on an about 1 . 6 kb DNA fragment was 
obtained. Its base sequence is shown by SEQ ID NO: 26 in 
the Sequence Listing. Furthermore, searching for an open 
reading frame (ORF) capable of encoding the protein 
demonstrated the presence of ORFl at base numbers 558 . 
through 1442, a portion thought to be ORF2 at base numbers 
1588 through 1434, and a portion thought to be ORES at 
base numbers 1 through 535. Encoding the protein whose 
N-terminal amino acid sequence was determined in (1), ORFl 
was deemed to be the AccIII restriction endonuclease gene. 
The base sequence of the AccIII restriction endonuclease 
gene is shown by SEQ ID NO: 2 in the Sequence Listing, 
wherein the fourth base, as counted from the 5' terminus, 
is C. The amino acid sequence deduced from the base 
sequence is shown by SEQ ID N0:1 in the Sequence Listing, 
wherein the second amino acid, as counted from the N 
terminus, is Leu. The direction of translation of ORF2 is 
opposite that of ORFl and 0RF3 . 

(5) Construction of the plasmid pCRA19 

Next, a plasmid for expression of the AccIII 
restriction endonuclease gene was constructed. First, to 
obtain the gene, the primer Acc-RL, shown by SEQ ID NO: 27 
in the Sequence Listing, and the primer Acc-RR, shown by 
SEQ ID NO: 28 in the Sequence Listing, were synthesized. A 
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restriction endonuclease Ncol recogni-tion sequence site 
was introduced into both primers . Using the above 
described primer pair, it is possible to cut out the 
AccIII restriction endonuclease gene from an amplified DNA 
5 fragment obtained by PGR with the genomic DNA of the Acc 
bacterium as a template, using the restriction 
endonuclease Ncol, and the translation codon frames 
coincide with each other under control of the SP6 promoter 
when the gene is inserted into the Ncol site of the 

10 pACE611 vector. On the other hand, a use of this primer 

pair results in replacement of the fourth base, as counted 
from the 5' terminus of the AccIII restriction 
endonuclease gene, from C to G, and of the second amino 
acid, as counted from the N terminus of the protein 

15 encoded by the gene, from Leu to Val. Using the above 
primer pair with the genomic DNA of the Acc bacterium 
obtained in (2) as a template, a PCR-based DNA 
amplification reaction was carried out. This DNA fragment 
was completely digested with the restriction endonuclease 

20 Ncol ( produced by Takara Shuzo ) , after which it was 

subjected to agarose gel electrophoresis; a DNA fragment 
of about 900 bp size was recovered. Next, this DNA 
fragment was inserted into the Ncol site of the pACE611 
vector so that it was located downstream of the SP6 

25 promoter. 
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This recombinant DNA was introduced into Escherichia 
coll JM109 to yield trans formant s . Thirty transf ormants 
were randomly selected, and each inoculated to 5 ml of an 
LB medium containing 30 pg/ml chloramphenicol and cultured 
5 at aye. When the ODggg value o£ the cell culture broth 

reached 0-6, IPTG was added to a final concentration of 2 
mM to increase the plasmid copy number, followed by 
further cultivation at 37 °C for 2 hours, after which 
plasmid DNA was prepared from each culture. Each plasmid 

10 DNA obtained was simultaneously cleaved with the 

restriction endonucleases Hindi I I and Xbal, followed by 
confirmation of the length of the resulting DNA fragments 
by agarose gel electrophoresis; a plasmid thought to 
contain the gene region inserted in the right direction 

15 was detected. This plasmid, designated pCRA19, was 
introduced into Escherichia coll JM109 to yield a 
transformant, which was designated Escherichia coll 
JM109/PCRA19. 



20 (6) Construction of expression system for the AccIII 
restriction endonuclease gene 

First, the SP6 RNA polymerase gene was introduced 
into Escherichia coll JM109 to construct a phage vector 
allowing further expression of the gene. Specifically, by 
25 inserting an SP6 RNA polymerase gene fragment obtained by 
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BamHI-Hindlll digestion of pSP6-2 into the BamHI-Hindlll 
site within the multicloning site of a commercial product 
of the phage vector M13mpl8 (produced by Takara Shuzo), 
the SP6 RNA polymerase expression phage M13sp6 was 
constructed . 

Next, the trans forman-t Escherichia coll JM109/pCRA19 
was inoculated to 5 ml of an LB medium containing 30 pg/ml 
Chloramphenicol and cultured at 37 °C. When the ODgoo value 
of the cell culture broth reached 0.6, IPTG was added to a 
final concentration of 2 mM to increase the plasmid copy 
number, followed by further cultivation at 37 °C for 2 
hours. These cells were then infected with the phage 
M13sp6 to express the protein encoded by the AccIII 
restriction endonuclease gene, followed by further 
cultivation at 37 °C for 16 hours. A 11 mg portion of the 
wet cells obtained was suspended in 180 pi of cell 
disruption buffer E [20 mM Tris-HCl (pH 7.5), 10 mM 
2-mercaptoethanol] , after which the cells were disrupted 
by ultrasonication, followed by centrifugation (18000 g, 
10 minutes) to separate the solid and liquid. 

Determining the activity of the supernatant under the 
activity determination conditions shown on the data sheet 
attached to the AccIII restriction endonuclease produced 
by Takara Shuzo demonstrated the production of the AccIII 
restriction endonuclease in an amount of about 8000 units 
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per gram of wet cells, a level about 16 times that 
obtained with the Acc bacterium, per unit weight of wet 
cells. Neither activity of restriction endonucleases 
other than AccIII nor AccIII modification enzyme activity 
was noted in the supernatant obtained. Regarding the 
AccIII restriction endonuclease gene inserted into the 
plasmid pCRA19, it was demonstrated that the fourth base, 
as counted from the translation initiation base, was 
replaced from C to G, upon DNA amplification by PGR, 
resulting in the replacement of the second amino acid, as 
counted from the N-terminus of the translated protein, 
from Leu to Val, and that the protein possesses AccIII 
restriction endonuclease activity. 

An isolation/mass production system for the AccIII 
restr-ic-tion endonuclease gene in the absence of AccIII 
modification enzyme was thus developed. 

(7) Isolation of the AccIII modification enzyme gene 

Modification enzyme genes and restriction 
endonuclease genes are often closely located. With this 
in mind, to determine the 0RF2 region deduced in (4) 
above, the base sequence of an about 1.4 kb DNA fragment 
obtained using the EcoRI cassette library as a template, 
out of the amplified DNA fragments prepared to determine 
the 3' region of the AccIII restriction endonuclease gene 
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in (4) above, was determined. Next, to determine the 0RF3 
region deduced in (4) above, the base sequence of an about 
2.2 kb DNA fragment obtained using the Bglll cassette 
library as a template, out of the amplified DNA fragments 
prepared to determine the 5 ' -terminal region of the AccIII 
restriction endonuclease gene in (4) above, was 
determined. Combining these base sequences and the base 
sequence of the about 1.6 kb DNA fragment containing the 
AccIII restriction endonuclease gene region determined in 
(4) resulted in the information on the about 4.2 kb base 
sequence shown by SEQ ID NO: 29 in the Sequence Listing. 
The AccIII restriction endonuclease gene was located at 
base numbers 1913 through 2797, 0RF2 at base numbers 3712 
through 2789, and 0RF3 at base numbers 691 through 1890. 
Of these ORFs, ORF2 proved to contain a portion highly 
homologous to the conserved region among modification 
enzymes in a known restriction modification system. 

Next, to obtain the 0RF2 region, an about 1 . 1 kb 
0RF2 -containing portion was cut out using EcoT22l and Hpal 
(produced by Takara Shuzo) from an amplified DNA fragment 
obtained by two-step PGR using a primer pair of Acc primer 
6 and cassette primer CI and another primer pair of Acc 
primer 5 and cassette primer C2 in the respective steps, 
with the EcoRI cassette library as a template, and was 
inserted into the Smal site downstream of the lac promoter 
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in the pUCllS vector. If this recombinant plasmid 
contains the AccIII modification enzyme gene^ and if the 
AccIII modification enzyme can be expressed in Escherichia 
coli, the DNA in the culture of the transformant obtained 
5 by introducing this recombinant plasmid into Escherichia 
coli JM109 would undergo methylation by the AccIII 
modification enzyme and acquire resistance to cleavage by 
the AccIII restriction endonuclease . 

Because the pUCllS vector used to construct this 
10 recombinant plasmid has no AccIII restriction endonuclease 
recognition sequence, however, it is inappropriate to use 
this recombinant plasmid by itself to confirm AccIII 
modification enzyme activity. On the other hand, there is 
an AccIII restriction endonuclease recognition sequence in 
15 the plasmid pSTV29, which can be co-present with this 

recombinant plasmid in Escherichia coli JM109. With this 
in mind, to utilize pSTV29 as an index of expression of 
the AccIII modification enzyme, the above recombinant 
plasmid and pSTV29 were both introduced into Escherichia 
20 coli JM109 to yield transf ormants . Three transf ormants 
v/ere each cultured at 37 °C for 16 hours in 2 ml of an LB 
medium containing 100 pg/ml ampicillin, 30 iig/ml 
chloramphenicol and 2 mM IPTG, after which plasmid DNA was 
prepared from each culture. 
25 The DNA thus prepared is available as a mixture of 
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pSTV29 and the above-described recombinant plasmid. When 
each DNA sample was subjected to a digestion with the 
AccIII restriction endonuclease, the pSTV29 in all samples 
exhibited resistance to the AccIII restriction 
endonuclease activity, demonstrating the insertion of the 
AccIII modification enzyme gene into the recombinant 
plasmid contained in the DNA sample. Furthermore, when 
the DNA sample was simultaneously cleaved with -the 
restriction endonucleases Hindlll and Xbal, followed by 
analysis of the length of the resulting DNA fragments by 
agarose gel electrophoresis, the presence of 0RF2 in the 
recombinant plasmid was confirmed. 0RF2 was thus proven 
to be the AccIII modification enzyme gene. The base 
sequence of the AccIII modification enzyme gene obtained 
and the amino acid sequence deduced therefrom are shown by 
SEQ ID NO: 14 and SEQ ID NO: 13 in the Sequence Listing, 
respectively . 

The structure of the AccIII restriction modification 
system gene demonstrated according to the present 
invention is shown in Figure 16, wherein M, R and the 
arrow represent the modification enzyme gene, the 
restriction endonuclease gene, and the orientation of ORF, 
respectively . 



INDUSTRIAL APPLICABILITY 
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The present invention provides a plasmid vector 
capable of introducing into a host an exogenous desired 
gene encoding a protein lethal or harmful to the host, and 
a method for being capable of efficiently expressing the 
5 protein using the plasmid vector for the first time. A 
method for being capable of isolating a restriction 
endonuclease gene which constitutes a restriction 
modification system without co-existence of a modification 
enzyme gene, which has been difficult in the prior arts, 
10 is also provided. Furthermore, an AcclII restriction 

endonuclease gene and an AccIII modification enzyme gene 
are isolated by the present invention, and, from 
Escherichia coll transformed with the plasmid containing 
the gene, it is possible to easily obtain an AccIII 
15 restriction endonuclease or an AccIII modification enzyme 
available in the genetic engineering at a desired purity 
of an enzyme preparation compared to the prior method for 
purification . 
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SEQUENCE LISTING 



SEQ ID N0:1 
SEQUENCE LENGTH: 295 
SEQUENCE TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 2 (Val or Leu) 

SEQUENCE DESCRIPTION: 

Met Xaa Pro Leu Asp Lys Asp Leu Gin Lys Ala Lys lie Ser lie 
15 10 15 

Thr Asp Phe Phe Glu lie Thr Asn Arg Val Leu Asp Tyr Phe Pro 
20 25 30 

Asn Val lie Asn Asn Thr Val Glu Lys Gly Asp Tyr Leu lie Ser 
35 40 45 

Ser Ser Asn lie Ala Gly Thr He Lys Phe Leu Arg Pro He Asn 
50 55 60 

Arg Lys Leu Phe He Gin Glu Lys Lys Val Phe Asn Asp Tyr Phe 
65 70 75 

Gin Lys Leu He He Val Phe Glu A^n He Arg Asn Lys Lys Thr 
80 85 90 

Val Thr Glu Glu Asp Lys He He He Asp Arg Val He Tyr Thr 
95 100 105 

He Gin Gin Ser He Gly He Gly Leu Asp Leu Met Val Asn Gin 
110 115 120 

Asn Ser Ala Arg Lys His Val Gly Asn Arg Phe Glu Glu Leu He 
125 130 135 

Arg Val He Phe Thr Glu He Ser Val Ser Asn Lys Arg Thr Val 
140 145 150 

Leu Gin He Pro Tyr Glu Thr Asp Glu Gly Gin Lys He Tyr Lys 
155 160 165 

Cys Glu Asn Asp Leu He He Ser Pro Phe Glu Asn Val Glu Ser 
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170 175 180 

Thr Asn Lys His Leu Asp Glu Asn Glu lie Val Val Ser lie Lys 

185 190 195 

Thr Thr Ser Lys Asp Arg Met Gly Lys Met Phe lie Asp Lys lie 

200 205 210 

Leu Leu Glu Arg Phe Val Lys His Pro Gin Lys Val lie Gly lie 

215 220 225 

Phe Leu Asn Asp Val Gin Arg Lys Glu Asp Asn Asn lie Ser Phe 

230 235 240 

Thr Leu Val Ser Gly Leu Phe Met Val Tyr Thr Lys Phe Leu Thr 

245 250 255 

Thr Leu Glu Gly lie Tyr Tyr Leu Asp Pro Pro Pro Asn Ala Leu 

260 265 270 

Lys Leu Pro Tyr Ser Asn His Met Lys Arg Phe Ser Asp Leu lie 

275 280 285 

Thr Glu Asp Leu Glu Lys Leu Phe Ser Ser 

290 295 



SEQ ID NO: 2 

SEQUENCE LENGTH: 885 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULE TYPE: Genomic DNA 

SEQUENCE DESCRIPTION: 

ATGSTACCAC TGGATAAAGA TTTACAAAAA GCAAAGATTT CAATTACTGA TTTTTTTGAA 60 
ATTACAAATA GAGTTTTAGA TTATTTCCCC AATGTAATCA ATAATACAGT TGAAAAAGGA 120 
GATTATTTAA TATCCTCATC AAATATTGCT GGAACAATAA AATTCCTAAG ACCAATCAAT 180 
AGAAAGTTAT TTATTCAGGA AAAAAAAGTT TTCAATGATT ATTTTCAAAA ACTGATTATA 240 
GTTTTTGAAA ATATAAGGAA CAAAAAAACT GTAACAGAGG AAGATAAAAT TATTATTGAT 300 
AGGGTAATTT ACACAATACA GCAATCTATT GGAATTGGTT TAGATTTAAT GGTTAATCAA 360 
AATAGTGCTA GAAAGCACGT TGGTAACCGA TTTGAAGAAT TAATTAGAGT CATTTTTACA 420 
GAAATATCAG TATCGAATAA AAGAACTGTA TTACAAATTC CATATGAAAC TGATGAAGGA 480 
CAGAAAATTT ACAAATGCGA GAATGACCTC ATTATTTCTC CTTTTGAAAA TGTAGAATCT 540 
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ACAAACAAAC ATCTAGATGA AAATGAGATT GTTGTTTCAA TAAAGACAAC ATCAAAAGAT 600 

AGGATGGGAA AAATGTTTAT AGATAAAATT TTACTTGAAA GGTTTGTTAA ACACCCTCAA 660 

AAAGTTATAG GGATTTTCCT CAATGATGTA CAAAGAAAAG AAGACAACAA TATCAGCTTT 720 

ACACTTGTTT CAGGATTATT TATGGTGTAT ACTAAATTCT TAACTACTCT TGAAGGGATC 780 

TATTATTTAG ATCCACCACC TAATGCATTG AAACTACCAT ATTCTAATCA TATGAAAAGA 840 

TTTTCAGATT TAATTACAGA AGACCTTGAA AAATTATTCT CCTCT 885 



SEQ ID NO: 3 
SEQUENCE LENGTH: 215 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TATGGATATG TTCATAAACA CGCATGTAGG CAGATAGATC TTTGGTTGTG AATCGCAACC 60 
AGTGGCCTTA TGGCAGGAGC CGCGGATCAC CTACCATCCC TAATGACCTG CAGGCATGCA 120 
AGCTTGCATG CCTGCAGGTC ATTAGGTACG GCAGGTGTGC TCGAGGCGAA GGAGTGCCTG 180 
CATGCGTTTC TCCTTGGCTT TTTTCCTCTG GGACA 215 

SEQ ID NO: 4 
SEQUENCE LENGTH: 215 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TATGTCCCAG AGGAAAAAAG CCAAGGAGAA ACGCATGCAG GCACTCCTTC GCCTCGAGCA 60 
CACCTGCCGT ACCTAATGAC CTGCAGGCAT GCAAGCTTGC ATGCCTGCAG GTCATTAGGG 120 
ATGGTAGGTG ATCCGCGGCT CCTGCCATAA GGCCACTGGT TGCGATTCAC AACCAAAGAT 180 
CTATCTGCCT ACATGCGTGT TTATGAACAT ATCCA 215 



SEQ ID NO: 5 
SEQUENCE LENGTH: 28 
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SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

AGATCTAGAG CAAACAAAAA AACCACCG 28 

SEQ ID NO: 6 
SEQUENCE LENGTH: 24 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GGTCTAGATC CCAGAGGAAA AAAG 24 

SEQ ID NO: 7 
SEQUENCE LENGTH: 100 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CTCGAGATTT AGGTGACACT ATAGAATACG GAATTGTGAG CGGATAACAA TTCCAAGCTT 60 
CACAGGAAAC AGACCATGGC TTAAGTAACT AGTGAATTCG 100 

SEQ ID NO: 8 
SEQUENCE LENGTH: 100 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CGAATTCACT AGTTACTTAA GCCATGGTCT GTTTCCTGTG AAGCTTGGAA TTGTTATCCG 60 
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CTCACAATTC CGTATTCTAT AGTGTCACCT AAATCTCGAG 

SEQ ID NO: 9 
SEQUENCE LENGTH: 27 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid ( synthe-tic DNA) 
SEQUENCE DESCRIPTION: 
AATCCCATGG AACGCTACGA ATCTCTG 

SEQ ID NO: 10 
SEQUENCE LENGTH: 29 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 
CCGGCCATGG TTATTTTTGA CACCAGACC 

SEQ ID NO: 11 
SEQUENCE LENGTH: 26 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 
TAACTTGAAT CCATGGGTTC TCACCG 

SEQ ID NO: 12 
SEQUENCE LENGTH: 29 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
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MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 
TACTCAGTAG CCATGGCTCT CATAGACCG 



SEQ ID NO: 13 
SEQUENCE LENGTH: 308 
SEQUENCE TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Asn Glu He Ala Phe Asp Asn Tyr Ser Tyr He Pro Lys Leu 
15 10 15 

Lys Leu Tyr Ser Glu He Glu Leu Lys Pro Phe Phe He Ser Lys 
20 25 30 

Asn Gly Ser Leu Phe Asn Val Asp Ala He Asp Phe Leu Arg Lys 
35 40 45 

Leu Glu Ser Asn Ser Val Asp Leu He Phe Ala Asp Pro Pro Tyr 
50 55 60 

Asn He Lys Lys Ala Glu Trp Asp He Phe Ser Ser Gin Asn Glu 
65 70 75 

Tyr Leu Glu Trp Ser Lys Glu Trp He Met Glu Ala His Arg Val 
80 ' 85 90 

Leu Lys Asp Asn Gly Ser Leu Tyr Val Cys Gly Phe Ser Glu He 
95 100 105 

Leu Ala Asp He Lys Phe He Thr Ser Lys Tyr Phe His Ser Cys 
HQ 115 120 

Lys Trp Leu He Trp Phe Tyr Arg Asn Lys Ala Asn Leu Gly Lys 
125 130 135 

Asp Trp Gly Arg Ser His Glu Ser He Leu Leu Leu Arg Lys Ser 
140 145 150 

Lys Asn Phe He Phe Asn He Asp Glu Ala Arg He Pro Tyr Asn 
155 160 165 

Glu His Thr Val Lys Tyr Pro Gin Arg Thr Gin Ala Glu Ser Ser 
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170 175 180 

Gin Tyr Ser Asn Ser Lys Lys Gin Tyr He Trp Glu Pro Asn Pro 

185 190 195 

Leu Gly Ala Lys Pro Lys Asp Val Leu Glu He Pro Thr He Ser 

200 205 210 

Asn Gly Ser Trp Glu Arg Ser He His Pro Thr Gin Lys Pro Val 

215 220 225 

Glu Leu Leu Lys Lys He He Leu Ser Ser Ser Asn Lys Asp Ser 

230 235 240 

Leu He Leu Asp Pro Phe Gly Gly Ser Gly Thr Thr Tyr Ala Val 

245 250 255 

Ala Glu Ala Phe Gly Arg Lys Trp He Gly Thr Glu Leu Asp Lys 

260 265 270 
Asn Tyr Cys Leu Glu He Gin Lys Arg Leu Lys Asp Glu Ser Met 

275 280 285 
He Asn Arg He Phe Ser Gly Asp Asp Asp Ser Asn Ser Gin Asn 

290 295 300 
Arg Arg Lys Lys Leu Arg Gly Glu 
305 



SEQ ID NO: 14 

SEQUENCE LENGTH: 924 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 

MOLECULE TYPE: Genomic DNA 

SEQUENCE DESCRIPTION: 

GTGAATGAAA TAGCGTTTGA TAATTACAGT TATATACCAA AATTAAAACT TTATTCGGAA 60 
ATCGAGCTTA AACCATTTTT TATTTCAAAA AACGGTTCAC TTTTCAATGT TGATGCTATT 120 
GATTTTTTAA GAAAATTAGA GAGTAATTCT GTGGATTTAA TTTTTGCAGA TCCACCTTAT 180 
AACATTAAAA AGGCAGAGTG GGATATTTTT TCTTCTCAAA ATGAATATCT CGAATGGAGT 240 
AAAGAATGGA TAATGGAAGC TCATAGAGTT TTAAAAGATA ATGGCAGTTT ATATGTTTGT 300 
GGCTTTTCAG AAATTCTGGC AGACATAAAA TTTATCACTT CAAAATATTT TCACAGTTGT 360 
AAATGGTTGA TTTGGTTCTA TAGAAACAAG GCAAATTTAG GTAAAGATTG GGGACGTTCA 420 
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CACGAAAGTA TACTGTTATT AAGAAAATCT AAAAATTTTA TTTTTAATAT TGATGAGGCA 480 

CGAATCCCGT ATAATGAGCA TACAGTTAAA TATCCACAAA GAACCCAGGC CGAATCTTCG 540 

CAATATTCGA ACTCAAAAAA GCAATATATT TGGGAGCCAA ACCCATTAGG AGCTAAGCCA 600 

AAAGATGTTT TGGAGATTCC CACAATTTCA AATGGTTCTT GGGAAAGAAG TATTCACCCT 650 

ACGCAAAAGC CAGTAGAATT GCTTAAAAAA ATAATTTTAT CTTCATCTAA TAAAGATAGT 720 

TTAATTCTTG ATCCATTTGG TGGTTCGGGA ACTACATATG CTGTTGCGGA AGCTTTTGGC 780 

AGAAAATGGA TTGGAACAGA GTTAGATAAA AATTATTGTC TGGAAATTCA AAAGCGATTG 840 

AAAGACGAAA GTATGATCAA CAGGATTTTT TCAGGCGATG ATGATTCAAA TTCTCAAAAT 900 

AGAAGAAAAA AATTAAGAGG AGAA 924 

SEQ ID NO: 15 
SEQUENCE LENGTH: 29 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TCGAGATTTA GGTGACACTA TAGAATACA 2 9 

SEQ ID NO: 16 
SEQUENCE LENGTH: 29 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single ' 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

AGCTTGTATT CTATAGTGTC ACCTAAATC 2 5 

SEQ ID NO: 17 
SEQUENCE LENGTH: 54 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
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SEQUENCE DESCRIPTION: 

TCGAGATTTA GGTGACACTA TAGAATACGG AATTGTGAGC GGATAACAAT TCCA 

SEQ ID NO: 18 
SEQUENCE LENGTH: 54 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY : 1 inear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

AGCTTGGAAT TGTTATCCGC TCACAATTCC GTATTCTATA GTGTCACCTA AATC 

SEQ ID NO: 19 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Leu Pro Leu Asp Lys Asp Leu Gin Lys Ala Lys lie Ser lie 
15 10 15 

Thr Asp Phe Phe Glu 

20 ' 

SEQ ID NO: 20 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
FEATURE: 6, 9, 12 (inosine) 
SEQUENCE DESCRIPTION: 
ATGTTNCCNY TNGAYAARGA YYT 
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SEQ ID NO: 21 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
FEATURE: 9 (inosine) 
SEQUENCE DESCRIPTION: 
AAGGATTTNC ARAARGCNAA RAT 

SEQ ID NO: 22 
SEQUENCE LENGTH: 30 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 
TAAATCTAAA CCAATTCCAA TAGATTGCTG 

SEQ ID NO: 23 
SEQUENCE LENGTH: 30 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 

SEQUENCE DESCRIPTION: 
CAAATCGGTT ACCAACGTGC TTTCTAGCAC 

SEQ ID NO: 24 
SEQUENCE LENGTH: 30 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
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SEQUENCE DESCRIPTION: 

GAACTGTATT ACAAATTCCA T ATGAAACTG 3 0 

SEQ ID NO: 25 
SEQUENCE LENGTH: 30 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE : Other nucleic acid ( syn-thetic DNA ) 
SEQUENCE DESCRIPTION: 

GACAGAAAAT TTACAAATGC GAGAATGACC 30 

SEQ ID NO: 26 

SEQUENCE LENGTH: 1588 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 

MOLECULE TYPE: Genomic DNA 

SEQUENCE DESCRIPTION: 

CCATGGCACA CGTTTCAAAA AAGAAATCCT CGAAGTCAAA TATGATGAGA AAAACATCTC 60 

AGACATCCTG CATATGACGG TGGATGAAGC ATTGGAATTT TTCTCGGAAA ATCACGAAGA 120 

AAAAATTGTA ACCAAACTAA AACCTTTGCA GGACGTTGGT TTGGGTTATC TTCAGTTAGG 180 

CCAGTCCTCC TCTACTCTTT CCGGCGGTGA AGCCCAAAGA GTGAAGCTCG CCTCTTTCCT 240 

TGTGAAAGGT GTAACGACGG AAAAAACGTT ATTTGTTTTT GATGAACCAT CAACAGGATT 300 

ACATTTCCAC GACATTCAAA AATTACTGAA ATCACTTCAG GCACTGATAG AATTAGGGCA 360 

TTCGGTTGTA GTGATTGAGC ATCAGCCGGA TATTATCAAA TGCGCCGATT ACATCATCGA 420 

TGTCGGACCC AATGCCGGAA AATACGGTGG CGAAATTGTT TTCACAGGAA CTCCGGAAGA 480 

TTTGGTAAAA GAGAAAAAGT CGTTTACAGG GAAGTATATT AAGGAGAAGT TAAAGTAATT 540 

TATTTATATT TGAAGTTATG CTACCACTGG ATAAAGATTT ACAAAAAGCA AAGATTTCAA 600 

TTACTGATTT TTTTGAAATT ACAAATAGAG TTTTAGATTA TTTCCCCAAT GTAATCAATA 560 

ATACAGTTGA AAAAGGAGAT TATTTAATAT CCTCATCAAA TATTGCTGGA ACAATAAAAT 720 

TCCTAAGACC AATCAATAGA AAGTTATTTA TTCAGGAAAA AAAAGTTTTC AATGATTATT 780 

TTCAAAAACT GATTATAGTT TTTGAAAATA TAAGGAACAA AAAAACTGTA ACAGAGGAAG 840 

ATAAAATTAT TATTGATAGG GTAATTTACA CAATACAGCA ATCTATTGGA ATTGGTTTAG 900 
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ATTTAATGGT TAATCAAAAT AGTGCTAGAA AGCACGTTGG TAACCGATTT GAAGAATTAA 960 
TTAGAGTCAT TTTTACAGAA ATATCAGTAT CGAATAAAAG AACTGTATTA CAAATTCCAT 1020 
ATGAAACTGA TGAAGGACAG AAAATTTACA AATGCGAGAA TGACCTCATT ATTTCTCCTT 1080 
TTGAAAATGT AGAATCTACA AACAAACATC TAGATGAAAA TGAGATTGTT GTTTCAATAA 1140 
AGACAACATC AAAAGATAGG ATGGGAAAAA TGTTTATAGA TAAAATTTTA CTTGAAAGGT 1200 
TTGTTAAACA CCCTCAAAAA GTTATAGGGA TTTTCCTCAA TGATGTACAA AGAAAAGAAG 1260 
ACAACAATAT CAGCTTTACA CTTGTTTCAG GATTATTTAT GGTGTATACT AAATTCTTAA 1320 
CTACTCTTGA AGGGATCTAT TATTTAGATC CACCACCTAA TGCATTGAAA CTACCATATT 1380 
CTAATCATAT GAAAAGATTT TCAGATTTAA TTACAGAAGA CCTTGAAAAA TTATTCTCCT 1440 
CTTAATTTTT TTCTTCTATT TTGAGAATTT GAATCATCAT CGCCTGAAAA AATCCTGTTG 1500 
ATCATACTTT CGTCTTTCAA TCGCTTTTGA ATTTCCAGAC AATAATTTTT ATCTAACTCT 1560 
GTTCCAATCC ATTTTCTGCC AAAAGCTT 1588 

SEQ ID NO: 27 
SEQUENCE LENGTH: 25 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

ATATTTGAAG CCATGGTACC ACTGG 25 

SEQ ID NO: 28 ' 
SEQUENCE LENGTH: 26 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Oyther nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GATGATTCAA ATTCTCACCA TGGAAG 2 6 

SEQ ID NO: 29 

SEQUENCE LENGTH: 4146 

SEQUENCE TYPE: nucleic acid 
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STRANDEDNESS ; double 
TOPOLOGY: linear 
MOLECULE TYPE: Genomic DNA 
SEQUENCE DESCRIPTION: 

AGATCTGGTC ATCCCAAACA AAAATCTTTC GGTTTACGAA GATGCAGTCG CTTCCTGGAA 60 
AGGCGAAAGT ATGAGCGAAT GGAAAAAGGA ATTCATCAAA AAAGCCAAAG ATTTCCCAAT 120 
TCACAAGCCT TATCATCAAC TCACAAAAGA GCAGAAACAG TTCCTTTGGA AAGGCGATAA 180 
AACCAGAAGT TTCCCAAGTA TTGATAATTT TTTCAAAATG CTTGAAGAGA ATCTTTACAA 240 
GATCCAATAC CGCGTAATGC TTTCGCGCTA TCGTGGGAAA ACACTTTGCC CCGATTGCGA 300 
AGGATTACGA TTGCGGGAAG AAACAAGCTG GGTGAAGATT GACGGACACA ACATTCAGTC 360 
TTTGATTGAA TTACCTTTGG ATGAACTCCT GCCATTGATC AAAAGCTTAA AACTGAACGT 420 
CCACGACAGA GAAATTGCCA AACGCCTGAC TTACGAAATC GAAACGAGAT TAGAATTCCT 480 
GACGAAAGTC GGCCTTGGAT ATCTGACTTT GAACCGAACA TCCAACACGC TTTCCGGAGG 540 
AGAAAGCCAG AGAATCAATC TGGCGACAGC TTGGGAAGTT CGCTGGTTGG TTCTATTTAT 600 
ATTTTGGATG AGCCGAGCAT TGGTCTGCAT TCCCGCGATA CAGAAAATCT GATTGGTGTC 660 
CTCAAACAAC TCCGCGATTT GGGAANTACC GTGATTGTTG TAGAACACGA CGAAGATGTG 720 
ATGCTTGCGG CAGNTTACAT TATAGATATT GGCCCNGNAG CGGGCTACCT TGGTGGCGAT 780 
CTTGTTTTCA GCGNGGATTA TAAAGAGATG CTGAAGTNTN ATACTTTAAC CGCAAAATAC 840 
CTGAATGGCG AACTGAAAAT AGAAGTTCCT GAAAAACGAA GAAAACCGAA GGAATTCATC 900 
GCAATAAAAG GTGCCCGCCA GAATAATTTA AAAAATATTG ACGTTGATGT TCCGTTAGAA 960 
TGTCTGACAG TTATCACAGG CGTTTCTGGA AGCGGGAAAT CCACTTTGAT GAAGGAAGTG 1020 
ATGACCAATG CCATCCAGAT CCAACTGGGA ATGGGCGGCA AAAAAGCCGA TTACGATTCG 1080 
GTGGAATTCC CGAAAAAGCT GATCCAGAAT ATCGAACTGA TTGACCAGAA CCCAATCGGG 1140 
AAATCGTCCC GCTCCAACCC CGTGACTTAT CTGAAAGCTT ACGACGATAT CCGGGATCTT 1200 
TTTGCGAAAC AAAAATCCGC AAAAATCCAG GGTTACAAAC CGAAGCATTT CTCCTTCAAT 1260 
GTGGATGGCG GAAGATGTGA CGAGTGCAAA GGCGAAGGTA TCATTACCGT ATCAATGCAG 1320 
TTTATGGCGG ACATCGAGCT GGAGTGTGAG CATTGCCATG GCACACGTTT CAAAAAAGAA 1380 
ATCCTCGAAG TCAAATATGA TGAGAAAAAC ATCTCAGACA TCCTGCATAT GACGGTGGAT 1440 
GAAGCATTGG AATTTTTCTC GGAAAATCAC GAAGAAAAAA TTGTAACCAA ACTAAAACCT 1500 
TTGCAGGACG TTGGTTTGGG TTATCTTCAG TTAGGCCAGT CCTCCTCTAC TCTTTCCGGC 1560 
GGTGAAGCCC AAAGAGTGAA GCTCGCCTCT TTCCTTGTGA AAGGTGTAAC GACGGAAAAA 1620 
ACGTTATTTG TTTTTGATGA ACCATCAACA GGATTACATT TCCACGACAT TCAAAAATTA 1680 
CTGAAATCAC TTCAGGCACT GATAGAATTA GGGCATTCGG TTGTAGTGAT TGAGCATCAG 1740 
CCGGATATTA TCAAATGCGC CGATTACATC ATCGATGTCG GACCCAATGC CGGAAAATAC 1800 
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GGTGGCGAAA TTGTTTTCAC AGGAACTCCG GAAGATTTGG TAAAAGAGAA AAAGTCGTTT 1860 
ACAGGGAAGT ATATTAAGGA GAAGTTAAAG TAATTTATTT ATATTTGAAG TTATGCTACC 1920 
ACTGGATAAA GATTTACAAA AAGCAAAGAT TTCAATTACT GATTTTTTTG AAATTACAAA 1980 
TAGAGTTTTA GATTATTTCC CCAATGTAAT CAATAATACA GTTGAAAAAG GAGATTATTT 2040 
AATATCCTCA TCAAATATTG CTGGAACAAT AAAATTCCTA AGACCAATCA ATAGAAAGTT 2100 
ATTTATTCAG GAAAAAAAAG TTTTCAATGA TTATTTTCAA AAACTGATTA TAGTTTTTGA 2160 
AAATATAAGG AACAAAAAAA CTGTAACAGA GGAAGATAAA ATTATTATTG ATAGGGTAAT 2220 
TTACACAATA CAGCAATCTA TTGGAATTGG TTTAGATTTA ATGGTTAATC AAAATAGTGC 2280 
TAGAAAGCAC GTTGGTAACC GATTTGAAGA ATTAATTAGA GTCATTTTTA CAGAAATATC 2340 
AGTATCGAAT AAAAGAACTG TATTACAAAT TCCATATGAA ACTGATGAAG GACAGAAAAT 2400 
TTACAAATGC GAGAATGACC TCATTATTTC TCCTTTTGAA AATGTAGAAT CTACAAACAA 2460 
ACATCTAGAT GAAAATGAGA TTGTTGTTTC AATAAAGACA ACATCAAAAG ATAGGATGGG 2520 
AAAAATGTTT ATAGATAAAA TTTTACTTGA AAGGTTTGTT AAACACCCTC AAAAAGTTAT 2580 
AGGGATTTTC CTCAATGATG TACAAAGAAA AGAAGACAAC AATATCAGCT TTACACTTGT 2640 
TTCAGGATTA TTTATGGTGT ATACTAAATT CTTAACTACT CTTGAAGGGA TCTATTATTT 2700 
AGATCCACCA CCTAATGCAT TGAAACTACC ATATTCTAAT CATATGAAAA GATTTTCAGA 2760 
TTTAATTACA GAAGACCTTG AAAAATTATT CTCCTCTTAA TTTTTTTCTT CTATTTTGAG 2820 
AATTTGAATC ATCATCGCCT GAAAAAATCC TGTTGATCAT ACTTTCGTCT TTCAATCGCT 2880 
TTTGAATTTC CAGACAATAA TTTTTATCTA ACTCTGTTCC AATCCATTTT CTGCCAAAAG 2940 
CTTCCGCAAC AGCATATGTA GTTCCCGAAC CACCAAATGG ATCAAGAATT AAACTATCTT 3000 
TATTAGATGA AGATAAAATT ATTTTTTTAA GCAATTCTAC TGGCTTTTGC GTAGGGTGAA 3060 
TACTTCTTTC CCAAGAACCA TTTGAAATTG TGGGAATCTC CAAAACATCT TTTGGCTTAG 3120 
CTCCTAATGG GTTTGGCTCC CAAATATATT GCTTTTTTGA GTTCGAATAT TGCGAAGATT 3180 
CGGCCTGGGT TCTTTGTGGA TATTTAACTG TATGCTCATT ATACGGGATT CGTGCCTCAT 3240 
CAATATTAAA AATAAAATTT TTAGATTTTC TTAATAACAG TATACTTTCG TGTGAACGTC 3300 
CCCAATCTTT ACCTAAATTT GCCTTGTTTC TATAGAACCA AATCAACCAT TTACAACTGT 3360 
GAAAATATTT TGAAGTGATA AATTTTATGT CTGCCAGAAT TTCTGAAAAG CCACAAACAT 3420 
ATAAACTGCC ATTATCTTTT AAAACTCTAT GAGCTTCCAT TATCCATTCT TTACTCCATT 3480 
CGAGATATTC ATTTTGAGAA GAAAAAATAT CCCACTCTGC CTTTTTAATG TTATAAGGTG 3540 
GATCTGCAAA AATTAAATCC ACAGAATTAC TCTCTAATTT TCTTAAAAAA TCAATAGCAT 3600 
CAACATTGAA AAGTGAACCG TTTTTTGAAA TAAAAAATGG TTTAAGCTCG ATTTCCGAAT 3660 
AAAGTTTTAA TTTTGGTATA TAACTGTAAT TATCAAACGC TATTTCATTC ACAAATGAAT 3720 
CAATCTGCTG TTGTGTATAA ACCCTGTAAT TATTAATAGG ATGTCTTAAA CTTTTGAATT 3780 
TTCCAGAATT ATCCCATCTT CCTTAATGTC TCAGAGTTAA CATCTAATAA TTTCGCCGCT 3840 
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TCTTTTATTG ATAAATAATC ATCCATATCT TACACAACAT TACACAAGTT TATACAGCAA 3900 
ATATAAATAT TTTTTATACA TTGTAAAAAT TTTATTTACT TTTATTTTGT TCAATTGTCT 3960 
CAATAAATAG TTAATCGAAA TACATTTTGA ATATGATAAA ATTGACTCCA ACAAATCTAA 4020 
CACAATGACA TTAAAACCAA TAAAAACGGA AGAAGATTAC AATCAGGTTT TAGAAAGACT 4080 
TTCACAAATT TTCGACGCTA AACCAAATAC CAAAGATGGA GATGAATTGG GAAATCTTGG 4140 



SEQ ID NO: 30 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: Genomic DNA 
SEQUENCE DESCRIPTION: 
ATTTAGGTGA CACTATAGAA TAG 
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CLAIMS 

1. A plasmid vector characterized by comprising a 
promoter sequence to control an expression of a desired 
5 gene, said promoter sequence being recognized by an RNA 
polymerase not inherent to a host, and a replication 
origin for increasing a copy number by induction with an 
exogenous factor'. 

10 >'2. The plasmid vector according to claim 1, wherein said 
promoter sequence is recognized by RNA polymerases derived 
from bacteriophages. 

/3. The plasmid vector according to claim 2, wherein said 
15 promoter sequence is recognized by an RNA polymerase 
derived from SP6 phage. 

4. The plasmid vector according to claim 3, wherein said 
promoter sequence contains the base sequence of SEQ ID 

20 NO: 30 set forth in the Sequence Listing. 

5. The plasmid vector according to any one of claims 1 
to 4, wherein said replication origin is under control of 
a promoter. 

25 
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6. The plasmid vector according to any one of claims 1 
to 5, wherein said replication origin is under control of 
the lac promoter - 

5 7 . The plasmid vector according to any one of claims , 1 
to 6, comprising a drug resistance gene as a selection 
marker . 

8. The plasmid vector according to claim 7, which is 
10 selected from pACE601, pACE611, pACE701 and pACE702. 

9 . A plasmid vector in which a desired gene to be 
expressed is incorporated into the plasmid vector 
according to any one of claims 1 to 8. 

15 

10. A method for expressing a desired gene, characterized 
by introducing into a host a plasmid vector in which the 
desired gene is incorporated into the plasmid vector 
according to any one of claims 1 to 8, and an RNA 

20 polymerase gene which recognizes a promoter sequence in 
said plasmid vector, and inducing an increase in a copy 
number of said plasmid vector and an expression of said 
RNA polymerase by using an exogenous factor to transcribe 
and translate the desired gene. 

25 
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11. The method for expressing a desired gene according to 
claim 10, characterized in that the increase in the copy 
number of the plasmid vector and the expression of the RNA 
polymerase are induced by respective exogenous factors. 

12. The method for expressing a desired gene according to 
claim 10, characterized in that the increase in the copy 
number of the plasmid vector and the expression of the RNA 
polymerase are induced by a same exogenous factor. 

13. The method for expressing a desired gene according to 
any one of claims 10 to 12, wherein said exogenous factor 
which induces the increase in the copy number of the 
plasmid vector, is one or more selected from the group 
consisting of an addition of isopropyl-p-D-thiogalactoside 
(IPTG), an addition of lactose, an addition of galactose, 
an addition of arabinose, a reduction of a tryptophane 
concentration and an adjustment of a transformant 
cultivation temperature. 

14. The method for expressing the desired gene according 
to any one of claims 10 to 12, wherein said exogenous 
factor which induces the expression of the RNA polymerase, 
is one or more selected from the group consisting of an 
addition of isopropyl-p-D-thiogalactoside (IPTG), an 
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addition of lactose, an addition of galactose, an addition 
of arabinose, a reduction of a tryptophane concentration 
and an adjustment of a transformant cultivation 
temperature . 

5 

15. The method for expressing a desired gene according to 
claim 10, characterized in that said RNA polymerase gene 
is in-troduced into the host by the other plasmid vector or 
a phage vector. 

10 

16. The method for expressing a desired gene according to 
claim 10, characterized in that said RNA polymerase gene 
is incorporated into a chromosome of the host. 

15 17. The method for expressing a desired gene according to 
claim 15 or 15, characterized in that said RNA polymerase 
gene is derived from SP6 phage. 

18. The method for expressing a desired gene according to 
20 any one of claims 10 to 17, wherein said desired gene 

encodes a protein lethal or harmful to the host. 

19. The method for expressing a desired gene according to 
any one of claims 10 to 18, characterized in that 

25 Escherichia coli is used as the host. 
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20. A method for isolating a desired gene, characterized 
in that the plasmid vector according to any one of claims 
1 to 8 is employed in the method for isolating the desired 
gene. 

21. The method for isolating a desired gene according to 
claim 20, wherein said desired gene encodes a protein 
lethal or harmful to a host. 

22. The method for isolating a desired gene according to 
claim 21, wherein the gene encoding a protein lethal or 
harmful to the host is a restriction endonuclease gene. 

23. A polypeptide containing the entire or a portion of 
the amino acid sequence shown by SEQ ID N0:1 set forth in 
the Sequence Listing, and possessing an activity of AccIII 
restriction endonuclease . 

24. A polypeptide having an amino acid sequence resulting 
from at least one of deletion, addition, insertion or 
substitution of one or more amino acid residues in the 
amino acid sequence of SEQ ID NO:l set forth in the 
Sequence Listing or a portion thereof, and possessing an 
activity of AccIII restriction endonuclease. 
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25. A DNA encoding a polypeptide which contains the 
entire or a portion of the amino acid sequence shown by 
SEQ ID N0:1 set forth in the Sequence Listing, and 
possesses an activity of AccIII restriction endonuclease. 

26. A DNA containing the entire or a portion of the DNA 
shown by SEQ ID NO: 2 set forth in the Sequence Listing 
wherein an expression product of said DNA possesses an 
activity of AccIII restriction endonuclease. 

27. A DNA encoding a polypeptide resulting from at least 
one of deletion, addition, insertion or substitution of 
one or more amino acid residues in the amino acid sequence 
of SEQ ID N0:1 set forth in the Sequence Listing or a 
portion thereof, and possessing an activity of AccIII 
restriction endonuclease. 

28. A DNA capable of hybridizing to the DNA according to 
any one of claims 25 to 27, and encoding a polypeptide 
possessing an activity of AccIII restriction endonuclease. 
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ABSTRACT 



A plasmid vector characterized by comprising a 
promoter sequence that can be recognized by an RNA 
polymerase which is not inherent in a host and that 
controls the expression of desired genes and a replication 
origin that increases the number of copies under the 
induction by exogenous factors; methods for expression and 
isolation of target genes by using the vector; a 
polypeptide having the activity of an AccIII restriction 
endonuclease; and a DNA encoding the polypeptide. The 
invention provides for the first time a plasmid vector 
which can introduce an exogenous desired gene encoding 
proteins which are lethal or harmful to hosts into the 
hosts, a method for efficiently expressing the proteins by 
using the vector, and also a method for permitting a 
restriction endonuclease gene constituting a restriction- 
modification system to be isolated even in the absence of 
a modification enzyme gene, which has been difficult in 
the prior arts . 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: SAGAWA, HIROAKI 
UENO, HARUMl 
OSHIMA, ATSUSHI 
KATO, IKUNOSHIN 

{ii) TITLE OF INVENTION: PLASMID 

(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: BIRCH, STEWART, KOLASCH & BIRCH, LLP 

(B) STREET: PO BOX 747 

(C) CITY: FALLS CHURCH 

(D) STATE: VA 

(E) COUNTRY: USA 

(F) ZIP: 22040-0747 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: WEINER, MARC S. 

(B) REGISTRATION NUMBER: 32,181 

(C) REFERENCE/DOCKET NUMBER: 1422-0319P 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 703-205-8000 
{B) TELEFAX: 703-205-8050 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "2=Val or Leu" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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Met Xaa Pro Leu Asp Lys Asp Leu Gin Lys Ala Lys lie Ser lie Thr 
15 10 15 

Asp Phe Phe Glu lie Thr Asn Arg Val Leu Asp Tyr Phe Pro Asn Val 
20 25 30 

lie Asn Asn Thr Val Glu Lys Gly Asp Tyr Leu lie Ser Ser Ser Asn 
35 40 45 

lie Ala Gly Thr lie Lys Phe Leu Arg Pro lie Asn Arg Lys Leu Phe 



lie Gin Glu Lys Lys Val Phe Asn Asp Tyr Phe Gin Lys Leu He He 
65 70 75 80 

Val Phe Glu Asn He Arg Asn Lys Lys Thr Val Thr Glu Glu Asp Lys 
85 90 95 

He He He Asp Arg Val He Tyr Thr He Gin Gin Ser He Gly He 
100 105 110 

Gly Leu Asp Leu Met Val Asn Gin Asn Ser Ala Arg Lys His Val Gly 
115 120 125 

Asn Arg Phe Glu Glu Leu He Arg Val He Phe Thr Glu He Ser Val 
130 135 ■ 140 

Ser Asn Lys Arg Thr Val Leu Gin He Pro Tyr Glu Thr Asp Glu Gly 
145 150 155 160 

Gin Lys lie Tyr Lys Cys Glu Asn Asp Leu He He Ser Pro Phe Glu 
165 170 175 

Asn Val Glu Ser Thr Asn Lys His Leu Asp Glu Asn Glu He Val Val 
180 185 190 

Ser He Lys Thr Thr Ser Lys Asp Arg Met Gly Lys Met Phe He Asp 
195 200 205 

Lys He Leu Leu Glu Arg Phe Val Lys His Pro Gin Lys Val He Gly 
210 215 220 

He Phe Leu Asn Asp Val Gin Arg Lys Glu Asp Asn Asn He Ser Phe 
225 230 235 240 

Thr Leu Val Ser Gly Leu Phe Met Val Tyr Thr Lys Phe Leu Thr Thr 
245 250 255 

Leu Glu Gly He Tyr Tyr Leu Asp Pro Pro Pro Asn Ala Leu Lys Leu 
260 265 270 

Pro Tyr Ser Asn His Met Lys Arg Phe Ser Asp Leu He Thr Glu Asp 
275 280 285 



Leu Glu Lys Leu Phe Ser Ser 
290 295 



INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATGSTACCAC TGGATAAAGA TTTACAAAAA GCAAAGATTT CAATTACTGA TTTTTTTGAA 60 

ATTACAAATA GAGTTTTAGA TTATTTCCCC AATGTAATCA ATAATACAGT TGAAAAAGGA 120 

GATTATTTAA TATCCTCATC AAATATTGCT GGAACAATAA AATTCCTAAG ACCAATCAAT 180 

AGAAAGTTAT TTATTCAGGA AAAAAAAGTT TTCAATGATT ATTTTCAAAA ACTGATTATA 24 0 

GTTTTTGAAA ATATAAGGAA CAAAAAAACT GTAACAGAGG AAGATAAAAT TATTATTGAT 300 

AGGGTAATTT ACACAATACA GCAATCTATT GGAATTGGTT TAGATTTAAT GGTTAATCAA 360 

AATAGTGCTA GAAAGCACGT TGGTAACCGA TTTGAAGAAT TAATTAGAGT CATTTTTACA 420 

GAAATATCAG TATCGAATAA AAGAACTGTA TTACAAATTC CATATGAAAC TGATGAAGGA 4 80 

CAGAAAATTT ACAAATGCGA GAATGACCTC ATTATTTCTC CTTTTGAAAA TGTAGAATCT 540 

ACAAACAAAC ATCTAGATGA AAATGAGATT GTTGTTTCAA TAAAGACAAC ATCAAAAGAT 600 

AGGATGGGAA AAATGTTTAT AGATAAAATT TTACTTGAAA GGTTTGTTAA ACACCCTCAA 660 

AAAGTTATAG GGATTTTCCT CAATGATGTA CAAAGAAAAG AAGACAACAA TATCAGCTTT 7 20 

ACACTTGTTT CAGGATTATT TATGGTGTAT ACTAAATTCT TAACTACTCT TGAAGGGATC 7 80 

TATTATTTAG ATCCACCACC TAATGCATTG AAACTACCAT ATTCTAATCA TATGAAAAGA 840 

TTTTCAGATT TAATTACAGA AGACCTTGAA AAATTATTCT CCTCT 885 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TATGGATATG TTCATAAACA CGCATGTAGG CAGATAGATC TTTGGTTGTG AATCGCAACC 60 
AGTGGCCTTA TGGCAGGAGC CGCGGATCAC CTACCATCCC TAATGACCTG CAGGCATGCA 120 
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AGCTTGCATG CCTGCAGGTC ATTAGGTACG GCAGGTGTGC TCGAGGCGAA GGAGTGCCTG 180 



CATGCGTTTC TCCTTGGCTT TTTTCCTCTG GGACA 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

TATGTCCCAG AGGAAAAAAG CCAAGGAGAA ACGCATGCAG GCACTCCTTC GCCTCGAGCA 60 

CACCTGCCGT ACCTAATGAC CTGCAGGCAT GCAAGCTTGC ATGCCTGCAG GTCATTAGGG 120 

ATGGTAGGTG ATCCGCGGCT CCTGCCATAA GGc'cACTGGT TGCGATTCAC AACCAAAGAT 180 

CTATCTGCCT ACATGCGTGT TTATGAACAT ATCCA 215 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGATCTAGAG CAAACAAAAA AACCACCG 28 
(2) INFORMATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
^ GGTCTAGATC CCAGAGGAAA AAAG 24 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CTCGAGATTT AGGTGACACT ATAGAATACG GAATTGTGAG CGGATAACAA TTCCAAGCTT 60 
CACAGGAAAC AGACCATGGC TTAAGTAACT AGTGAATTCG 100 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CGAATTCACT AGTTACTTAA GCCATGGTCT GTTTCCTGTG AAGCTTGGAA TTGTTATCCG 60 
CTCACAATTC CGTATTCTAT AGTGTCACCT AAATCTCGAG 100 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
AATCCCATGG AACGCTACGA ATCTCTG 
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(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCGGCCATGG TTATTTTTGA CACCAGACC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ^ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TAACTTGAAT CCATGGGTTC TCACCG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TACTCAGTAG CCATGGCTCT CATAGACCG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 amino acids 

(B) TYPE: amino acid 

(C) STEIANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Asn Glu lie Ala Phe Asp Asn Tyr Ser Tyr lie Pro Lys Leu Lys 
15 10 15 

Leu Tyr Ser Glu lie Glu Leu Lys Pro Phe Phe lie Ser Lys Asn Gly 
20 25 30 

Ser Leu Phe Asn Val Asp Ala lie Asp Phe Leu Arg Lys Leu Glu Ser 



Asn Ser Val Asp Leu lie Phe Ala Asp Pro Pro Tyr Asn lie Lys Lys 
50 55 60 

Ala Glu Trp Asp lie Phe Ser Ser Gin Asn Glu Tyr Leu Glu Trp Ser 
65 70 75 80 

Lys Glu Trp lie Met Glu Ala His Arg Val Leu Lys Asp Asn Gly Ser 



Leu Tyr Val Cys Gly Phe Ser Glu lie Leu Ala Asp He Lys Phe He 
100 105 110 

Thr Ser Lys Tyr Phe His Ser Cys Lys Trp Leu He Trp Phe Tyr Arg 
115 120 125 

Asn Lys Ala Asn Leu Gly Lys Asp Trp Gly Arg Ser His Glu Ser He 
130 135 140 

Leu Leu Leu Arg Lys Ser Lys Asn Phe He Phe Asn He Asp Glu Ala 
145 150 155 160 

Arg He Pro Tyr Asn Glu His Thr Val Lys Tyr Pro Gin Arg Thr Gin 
165 170 175 

Ala Glu Ser Ser Gin Tyr Ser Asn Ser Lys Lys Gin Tyr He Trp Glu 
180 185 190 

Pro Asn Pro Leu Gly Ala Lys Pro Lys Asp Val Leu Glu He Pro Thr 
195 200 205' 

He Ser Asn Gly Ser Trp Glu Arg Ser He His Pro Thr Gin Lys Pro 
. 210 215 220 

Val Glu Leu Leu Lys Lys He He Leu Ser Ser Ser Asn Lys Asp Ser 
225 230 235 240 

Leu He Leu Asp Pro Phe Gly Gly Ser Gly Thr Thr Tyr Ala Val Ala 
245 250 255 

Glu Ala Phe Gly Arg Lys Trp He Gly Thr Glu Leu Asp Lys Asn Tyr 
• 260 265 270 



Cys Leu Glu He Gin Lys Arg Leu Lys Asp Glu Ser Met He Asn Arg 



275 280 285 

lie Phe Ser Gly Asp Asp Asp Ser Asn Sex Gin Asn Arg Arg Lys Lys 
290 295 300 

Leu Arg Gly Glu 
305 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GTGAATGAAA TAGCGTTTGA TAATTACAGT TATATACCAA AATTAAAACT TTATTCGGAA 60 

ATCGAGCTTA AACCATTTTT TATTTCAAAA AAC6GTTCAC TTTTCAATGT TGATGCTATT 120 

GATTTTTTAA GAAAATTAGA GAGTAATTCT GTGGATTTAA TTTTTGCAGA TCCACCTTAT 180 

AACATTAAAA AGGCAGAGTG GGATATTTTT TCTTCTCAAA ATGAATATCT CGAATGGAGT 24 0 

AAAGAATGGA TAATGGAAGC TCATAGAGTT TTAAAAGATA ATGGCAGTTT ATATGTTTGT 300 

GGCTTTTCAG AAATTCTGGC AGACATAAAA TTTATCACTT CAAAATATTT TCACAGTTGT 3 50 

AAATGGTTGA TTTGGTTCTA TAGAAACAAG GCAAATTTAG GTAAAGATTG GGGACGTTCA 4 20 

CACGAAAGTA TACTGTTATT AAGAAAATCT AAAAATTTTA TTTTTAATAT TGATGAGGCA 4 80 

CGAATCCCGT ATAATGAGCA TACAGTTAAA TATCCACAAA GAACCCAGGC CGAATCTTCG 54 0 

CAATATTCGA ACTCAAAAAA GCAATATATT TGGGAGCCAA ACCCATTAGG AGCTAAGCCA 600 

AAAGATGTTT TGGAGATTCC CACAATTTCA AATGGTTCTT GGGAAAGAAG TATTCACCCT 650 

ACGCAAAAGC CAGTAGAATT GCTTAAAAAA ATAATTTTAT CTTCATCTAA TAAAGATAGT 720 

TTAATTCTTG ATCCATTTGG TGGTTCGGGA ACTACATATG CTGTTGCGGA AGCTTTTGGC 7 80 

AGAAAATGGA TTGGAACAGA GTTAGATAAA AATTATTGTC TGGAAATTCA AAAGCGATTG 84 0 

AAAGACGAAA GTATGATCAA CAGGATTTTT TCAGGCGATG ATGATTCAAA TTCTCAAAAT 900 

AGAAGAAAAA AATTAAGAGG AGAA 924 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



89 



(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION; /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCGAGATTTA GGTGACACTA TAGAATACA 29 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION; /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AGCTTGTATT CTATAGTGTC ACCTAAATC 29 
(2) INFORMATION FOR SEQ ID NO: 17: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TCGAGATTTA GGTGACACTA TAGAATACGG AATTGTGAGC GGATAACAAT TCCA 54 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



AGCTTGGAAT TGTTATCCGC TCACAATTCC GTATTCTATA GTGTCACCTA AATC 
(2) INFORMATION FOR SEQ ID NO: 19: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Leu Pro Leu Asp Lys Asp Leu Gin Lys Ala Lys lie Ser lie Thr 
15 10 15 

Asp Phe Phe Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: primer_bind 

(B) LOCATION: 1..23 

(D) OTHER INFORMATION: /note= "6, 9, 12 = inosine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ATGTTNCCNY TNGAYAARGA YYT 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 
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(A) NAME/KEY: priraer_bind 

(B) LOCATION: 1..23 

(D) OTHER INFORMATION: /note= "9 = inosine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AAGGATTTNC ARAARGCNAA RAT 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TAAATCTAAA CCAATTCCAA TAGATTGCTG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TAAATCTAAA CCAATTCCAA TAGATTGCTG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
, (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
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GAACTGTATT ACAAATTCCA TATGAAACTG 
{2) INFORMATION FOR SEQ ID NO: 25: 

{!■) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GACAGAAAAT TTACAAATGC GAGAATGACC 
(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



CCATGGCACA 


CGTTTCAAAA 


AAGAAATCCT 


CGAAGTCAAA 


TATGATGAGA 


AAAACATCTC 


60 


AGACATCCTG 


CATATGACGG 


TGGATGAAGC 


ATTGGAATTT 


TTCTCGGAAA 


ATCACGAAGA 


120 


AAAAATTGTA 


ACCAAACTAA 


AACCTTTGCA 


GGACGTTGGT 


TTGGGTTATC 


TTCAGTTAGG 


180 


CCAGTCCTCC 


TCTACTCTTT 


CCGGCGGTGA 


AGCCCAAAGA 


GTGAAGCTCG 


CCTCTTTCCT 


240 


TGTGAAAGGT 


GTAACGACGG 


AAAAAACGTT 


ATTTGTTTTT 


GATGAACCAT 


CAACAGGATT 


300 


ACATTTCCAC 


GACATTCAAA 


AATTACTGAA 


ATCACTTCAG 


GCACTGATAG 


AATTAGGGCA 


360 


TTCGGTTGTA 


GTGATTGAGC 


ATCAGCCGGA 


TATTATCAAA 


TGCGCCGATT 


ACATCATCGA 


420 


TGTCGGACCC 


AATGCCGGAA 


AATACGGTGG 


CGAAATTGTT 


TTCACAGGAA 


CTCCGGAAGA 


480 


TTTGGTAAAA 


GAGAAAAAGT 


CGTTTACAGG 


GAAGTATATT 


AAGGAGAAGT 


TAAAGTAATT 


540 


TATTTATATT 


TGAAGTTATG 


CTACCACTGG 


ATAAAGATTT 


ACAAAAAGCA 


AAGATTTCAA 


600 


TTACTGATTT 


TTTTGAAATT 


ACAAATAGAG 


TTTTAGATTA 


TTTCCCCAAT 


GTAATCAATA 


660 


ATACAGTTGA 


AAAAGGAGAT 


TATTTAATAT 


CCTCATCAAA 


TATTGCTGGA 


ACAATAAAAT 


720 


TCCTAAGACC 


AATCAATAGA 


AAGTTATTTA 


TTCAGGAAAA 


AAAAGTTTTC 


AATGATTATT 


780 
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TTCAAAAACT 



GATTATAGTT 



TTTGAAAATA TAAGGAACAA AAAAACTGTA ACAGAGGAAG 



840 



ATAAAATTAT 



TATTGATAGG 



GTAATTTACA CAATACAGCA ATCTATTGGA ATTGGTTTAG 



900 



ATTTAAtGGT 



TAATCAAAAT 



AGTGCTAGAA AGCACGTTGG TAACCGATTT GAAGAATTAA 



960 



TTAGAGTCAT TTTTACAGAA ATATCAGTAT CGAATAAAAG AACTGTATTA CAAATTCCAT 1020 

ATGAAACTGA TGAAGGACAG AAAATTTACA AATGCGAGAA TGACCTCATT ATTTCTCCTT 1080 

TTGAAAATGT AGAATCTACA AACAAACATC TAGATGAAAA TGAGATTGTT GTTTCAATAA 1140 

AGACAACATC AAAAGATAGG ATGGGAAAAA TGTTTATAGA TAAAATTTTA CTTGAAAGGT 1200 

TTGTTAAACA CCCTCAAAAA GTTATAGGGA TTTTCCTCAA TGATGTACAA AGAAAAGAAG 12 60 

ACAACAATAT CAGCTTTACA CTTGTTTCAG GATTATTTAT GGTGTATACT AAATTCTTAA 1320 

CTACTCTTGA AGGGATCTAT TATTTAGATC CACCACCTAA TGCATTGAAA CTACCATATT 1380 

CTAATCATAT GAAAAGATTT TCAGATTTAA TTACAGAAGA CCTTGAAAAA TTATTCTCCT 14 4 0 

CTTAATTTTT TTCTTCTATT TTGAGAATTT GAATCATCAT CGCCTGAAAA AATCCTGTTG 1500 

ATCATACTTT CGTCTTTCAA TCGCTTTTGA ATT^TCCAGAC AATAATTTTT ATCTAACTCT 1560 

GTTCCAATCC ATTTTCTGCC AAAAGCTT 158 8 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
ATATTTGAAG CCATGGTACC ACTGG 25 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
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GATGATTCAA ATTCTCACCA TGGAAG 26 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AGATCTGGTC ATCCCAAACA AAAATCTTTC GGTTTACGAA GATGCAGTCG CTTCCTGGAA 60 

AGGCGAAAGT ATGAGCGAAT GGAAAAAGGA ATTCATCAAA AAAGCCAAAG ATTTCCCAAT 120 

TCACAAGCCT TATCATCAAC TCACAAAAGA GCAGAAACAG TTCCTTTGGA AAGGCGATAA 180 

AACCAGAAGT TTCCCAAGTA TTGATAATTT TTT^AAAATG CTTGAAGAGA ATCTTTACAA 240 

GATCCAATAC CGCGTAATGC TTTCGCGCTA TCGTGGGAAA ACACTTTGCC CCGATTGCGA 300 

AGGATTACGA TTGCGGGAAG AAACAAGCTG GGTGAAGATT GACGGACACA ACATTCAGTC 360 

TTTGATTGAA TTACCTTTGG ATGAACTCCT GCCATTGATC AAAAGCTTAA AACTGAACGT 420 

CCACGACAGA GAAATTGCCA AACGCCTGAC TTACGAAATC GAAACGAGAT TAGAATTCCT 4 80 

GACGAAAGTC GGCCTTGGAT ATCTGACTTT GAACCGAACA TCCAACACGC TTTCCGGAGG 540 

AGAAAGCCAG AGAATCAATC TGGCGACAGC TTGGGAAGTT CGCTGGTTGG TTCTATTTAT 600 

ATTTTGGATG AGCCGAGCAT TGGTCTGCAT TCCCGCGATA CAGAAAATCT GATTGGTGTC 660 

CTCAAACAAC TCCGCGATTT GGGAANTACC GTGATTGTTG TAGAACACGA CGAAGATGTG 720 

ATGCTTGCGG CAGNTTACAT TATAGATATT GGCCCNGNAG CGGGCTACCT TGGTGGCGAT 780 

CTTGTTTTCA GCGNGGATTA TAAAGAGATG CTGAAGTNTN ATACTTTAAC CGCAAAATAC 840 

CTGAATGGCG AACTGAAAAT AGAAGTTCCT GAAAAACGAA GAAAACCGAA GGAATTCATC 900 

GCAATAAAAG GTGCCCGCCA GAATAATTTA AAAAATATTG ACGTTGATGT TCCGTTAGAA 960 

TGTCTGACAG TTATCACAGG CGTTTCTGGA AGCGGGAAAT CCACTTTGAT GAAGGAAGTG 1020 

ATGACCAATG CCATCCAGAT CCAACTGGGA ATGGGCGGCA AAAAAGCCGA TTACGATTCG 1080 

GTGGAATTCC CGAAAAAGCT GATCCAGAAT ATCGAACTGA TTGACCAGAA CCCAATCGGG 114 0 

AAATCGTCCC GCTCCAACCC CGTGACTTAT CTGAAAGCTT ACGACGATAT CCGGGATCTT 1200 

TTTGCGAAAC AAAAATCCGC AAAAATCCAG GGTTACAAAC CGAAGCATTT CTCCTTCAAT 1260 

GTGGATGGCG GAAGATGTGA CGAGTGCAAA GGCGAAGGTA TCATTACCGT ATCAATGCAG 1320 
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TTTATGGCGG 


ACATCGAGCT 


GGAGTGTGAG 


CATTGCCATG 


GCACACGTTT 


CAAAAAAGAA 


1380 


ATCCTCGAAG 


TCAAATATGA 


TGAGAAAAAC 


ATCTCAGACA 


TCCTGCATAT 


GACGGTGGAT 


1440 


GAAGCATTGG 


AATTTTTCTC 


GGAAAATCAC 


GAAGAAAAAA 


TTGTAACCAA 


ACTAAAACCT 


1500 


TTGCAGGACG 


TTGGTTTGGG 


TTATCTTCAG 


TTAGGCCAGT 


CCTCCTCTAC 


TCTTTCCGGC 


1560 


GGTGAAGCCC 


AAAG AGT G AA 


GCTCGCCTCT 


TTCCTTGTGA 


AAGGTGTAAC 


GACGGAAAAA 


1620 


ACGTTATTTG 


TTTTTGATGA 


ACCATCAACA 


GGATTACATT 


TCCACGACAT 


TCAAAAATTA 


1680 




TTCAGGCACT 


GATAGAATTA 


GGGCATTCGG 


TTGTAGTGAT 


TGAGCATCAG 


1740 


CCGGATATTA 


TCAAATGCGC 


CGATTACATC 


ATCGATGTCG 


GACCCAATGC 


CGGAAAATAC 


1800 


GGTGGCGAAA 


TTGTTTTCAC 


AGGAACTCCG 


GAAGATTTGG 


TAAAAGAGAA 


AAAGTCGTTT 


1860 




AT AT TAAGGA 


GAAGT T AAAG 


TAATTTATTT 


ATATTTGAAG 


TTATGCTACC 


1920 




GATTTACAAA 


AAGCAAAGAT 


TTCAATTACT 


GATTTTTTTG 


AAATTACAAA 


1980 


TAGAGTTTTA 


GATTATTTCC 




CAATAATACA 


GTTGAAAAAG 


GAGAT TAT T T 


2040 


AATATCCTCA 


TCAAATATTG 




AAAATTCCTA 


AGACCAATCA 


AT AGAAAGT T 


2100 


ATTTATTCAG 


GAAAAAAAAG 


TTTTCAATGA 




AAACTGATTA 


TAGTTTTTGA 


2160 


AAATATAAGG 


AACAAAAAAA 


CTGTAACAGA 


GGAAGATAAA 




ATAGGGTAAT 


2220 


T T AC AC AAT A 


CAGCAATCTA 




TTTAGATTTA 


ATGGTTAATC 


AAAATAGTGC 


2280 


TAGAAAGCAC 




GATT TGAAGA 


AT T AAT TAG A 


GTCATTTTTA 


C AG AAAT AT C 


2340 


AGTATCGAAT 


AAAAGAACT G 


TAT TACAAAT 


TCCATATGAA 


ACTGATGAAG 


GACAGAAAAT 


2400 


T TACAAAT GC 


GAGAAT GACC 


TCATTATTTC 


TCCTTTTGAA 


AATGTAGAAT 


CTACAAACAA 


2460 


AC AT C TAG AT 


G AAAAT GAGA 


TTGTTGTTTC 


AATAAAGACA 


ACATCAAAAG 


ATAGGATGGG 


2520 


AAAAATGTTT 


ATAGATAAAA 


TTTTACTTGA 


AAGGTTTGTT 


AAACACCCTC 


AAAAAGTTAT 


2580 


AGGGATTTTC 


CTCAATGATG 


TACAAAGAAA 


AGAAGACAAC 


AATATCAGCT 


TTACACTTGT 


2640 


TTCAGGATTA 


TTTATGGTGT 


ATACTAAATT 


CTTAACTACT 


CTTGAAGGGA 


TCTATTATTT 


2700 


AGAT CC ACC A 


CCTAATGCAT 


TGAAACTACC 


ATATTCTAAT 


CATATGAAAA 


GATTTTCAGA 


2760 


TTTAATTACA 


GAAGACCTTG 


AAAAATTATT 


CTCCTCTTAA 


TTTTTTTCTT 


CTATTTTGAG 


2820 


AATTTGAATC 


ATCATCGCCT 


GAAAAAATCC 


TGTTGATCAT 


ACTTTCGTCT 


TTCAATCGCT 


2880 


TTTGi\ATTTC 


CAGACAATAA 


TTTTTATCTA 


ACTCTGTTCC 


AATCCATTTT 


CTGCCAAAAG 


2940 


CTTCCGCAAC 


AGCATATGTA 


GTTCCCGAAC 


CACCAAATGG 


■ ATCAAGAATT 


' AAACTATCTT 




TATTAGATGA 


AGATAAAATT 


ATTTTTTTAA 


, GCAATTCTAC 


; TGGCTTTTGC 


; GTAGGGTGAA 


3060 


TACTTCTTTC 


CCAAGAACCA 


. TTTGAAATTG 


TGGGAATCTC 


; CAAAACATCT 


' TTTGGCTTAG 


3120 
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CTCCTAATGG 


GTTTGGCTCC 


CAAATATATT 


GCTTTTTTGA 


GTTCGAATAT 


TGCGAAGATT 


3180 


CGGCCTGGGT 


TCTTTGTGGA 


TATTTAACTG 


TATGCTCATT 


ATACGGGATT 


CGTGCCTCAT 


3240 


CAATATTAAA 


AATAAAATTT 


TTAGATTTTC 


TTAATAACAG 


TATACTTTCG 


TGTGAACGTC 


3300 


CCCAATCTTT 


ACCTAAATTT 


GCCTTGTTTC 


TATAGAACCA 


AATCAACCAT 


TTACAACTGT 


3360 


GAAAATATTT 


TGAAGTGATA 


AATTTTATGT 


CTGCCAGAAT 


TTCTGAAAAG 


CCACAAACAT 


3420 


ATAAACTGCC 


ATTATCTTTT 


AAAACTCTAT 


GAGCTTCCAT 


TATCCATTCT 


TTACTCCATT 


3480 


CGAGATATTC 


ATTTTGAGAA 


GAA?y\AATAT 


CCCACTCTGC 


CTTTTTAATG 


TTATAAGGTG 


3540 


GATCTGCAAA 


AATTAAATCC 


ACAGAATTAC 


TCTCTAATTT 


TCTTAAAAAA 


TCAATAGCAT 


3600 


CAACATTGAA 


AAGTGAACCG 


TTTTTTGAAA 


TAAAAAATGG 


TTTAAGCTCG 


ATTTCCGAAT 


3660 


AAAGTTTTAA 


TTTTGGTATA 


TAACTGTAAT 


TATCAAACGC 


TATTTCATTC 


ACAAATGAAT 


3720 


CAATCTGCTG 


TTGTGTATAA 


ACCCTGTAAT 


TATTAATAGG 


ATGTCTTAAA 


CTTTTGAATT 


3780 


TTCCAGAATT 


ATCCCATCTT 


CCTTAATGTC 


TCAGAGTTAA 


CATCTAATAA 


TTTCGCCGCT 


3840 


TCTTTTATTG 


ATAAATAATC 


ATCCATATCT 


TACACAACAT 


TACACAAGTT 


TATACAGCAA 


3900 


ATATAAATAT 


TTTTTATACA 


TTGTAAAAAT 


TTTATTTACT 


TTTATTTTGT 


TCAATTGTCT 


3960 


CAATAAATAG 


TTAATCGAAA 


TACATTTTGA 


AT AT GAT AAA 


ATTGACTCCA 


ACAAATCTAA 


4020 


CACAATGACA 


TTAAAACCAA 


TAAAAACGGA 


AGAAGATTAC 


AATCAGGTTT 


TAGAAAGACT 


4080 


TTCACAAATT 


TTCGACGCTA 


AACCAAATAC 


CAAAGATGGA 


GATGAATTGG 


GAAATCTTGG 


4140 


GAATTC 












4146 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xJl) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
ATTTAGGTGA CACTATAGAA TAG 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AAGCTCGAGT CTGATGACGA AGCTTGACTG ACTGAGATCA GCTTGCAAC 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CTCGAGATTT AGGTGACACT ATAGAATACA AGCTT 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTCGAGATTT AGGTGACACT ATAGAATACG GAATTGTGAG CGGATAACAA TTCCAAGCTT 
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